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ABSTRACT 

We explore the dependence of galaxy stellar population properties that are derived 
from broad-band spectral energy distribution (SED) fitting - such as age, stellar mass, 
dust reddening, etc. - on a variety of parameters, such as star formation histories, 
age grid, metallicity, initial mass function (IMF), dust reddening and reddening law, 
filter setup and wavelength coverage. Mock galaxies are used as test particles. We 
confirm our earlier results based on real z = 2 galaxies, that usually adopted r-models 
lead to overestimate the star formation rate and to underestimate the stellar mass. 
Here, we show that - for star-forming galaxies - galaxy ages, masses and reddening, 
can be well determined simultaneously only when the correct star formation history is 
identified. This is the case for inverted-r models at high-z, for which we find that the 
mass recovery (at fixed IMF) is as good as ~ 0.04 dex. However, since the right star 
formation history is usually unknown, we quantify the offsets generated by adopting 
standard fitting setups. Stellar masses are generally underestimated, which results 
from underestimating ages. For mixed fitting setups with a variety of star formation 
histories the median mass recovery at z ~ 2 — 3 is as decent as ~ 0.1 dex (at fixed 
IMF), albeit with large scatter. The situation worsens towards lower redshifts, because 
of the variety of possible star formation histories and ages. At z ^ 0.5 the stellar 
mass can be underestimated by as much as ~ 0.6 dex (at fixed IMF). A practical 
trick to improve upon this figure is to exclude reddening from the fitting parameters, 
as this helps to avoid unrealistically young and dusty solutions. Stellar masses are 
underestimated by a smaller amount (~ 0.3 dex at 2 ^ 0.5). Reddening and the star 
formation rate should then be determined via a separate fitting. As expected, the 
recovery of properties is better for passive galaxies, for which e.g. the mass can be 
fully recovered (within ^ 0.01 dex at fixed IMF) when using a fitting setup including 
metallicity effects. In both cases of star-forming as well as passive galaxies, the recovery 
of physical parameters is dependent on the spectral range involved in the fitting. We 
find that a coverage from the rest-frame UV to the rest-frame near-IR appears to be 
optimal. We also quantify the effect of narrowing the wavelength coverage or adding 
and removing filter bands, which can be useful for planning observational surveys. 
Finally, we provide scaling relations that allow the transformation of stellar masses 
obtained using different template fitting setups and stellar population models. 

Key words: galaxies: general - galaxies: evolution - galaxies: formation - galaxies: 
fundamental parameters 



1 INTRODUCTION 

Over the last two decades numerous galaxy surveys 
were designed and performed to st udy galaxy f orma- 
tion and evolution, e.g. CANDELS (|Grogin et al.l I2OIII : 
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iKoekemoer et all I2OIII) CO MBO-17 (IWolf et al 



COSMOS JScoville et al.]|2007l '). DEEP2 JDavis et al.ll2003l') 
GMASS (|Kurke^_alJ |2009|) GOODS jDickinson et alj 



2001 



120031 ) ■ SDSS (|York et al.ll200ol ). SERVS (|Lacv ct al. 200^). 
To this end, the determination of the physical properties of 
galaxies, such as ages, stellar masses, star formation rates 
(SFRs), star formation mode, dust reddening, metallicity 
and - when necessary - photometric redshifts possibly as a 
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function of look-back time and environment is crucial. Even 
for galaxy surveys devo ted primarily to Cosmology , such 
as BOSS ( SchloEol ct aLl l2009l : lEisenstein et all 1201 ll). PES 
(http : //www. darkGncrgysurvcy.org/), LSST I Strauss et al.l 
l20ici ) and WiggleZ (Blake ct al. 200i), galaxy properties 
need to be accurately determined as gala xies bias the read- 
out of the cosmological signal (e.g. i White et al.ll2011f) . 
Galaxy properties are derived by fitting stellar population 
models to data, and several approaches ar e taken, which 
have pros and cons (fo r recent reviews see IWalcher et al.l 
I2OIII : iMarastonI l2011al ). For our purpose let us briefly 
distinguish between spectroscopic and photometric meth- 
ods. While the availability of spectra is mandatory for 
the determination of detailed chemical abundances, such as 
chemical abundance ratios, which is accomplished through 
using selected absorption line indic es and compar i ng to 



element-ratio sens i tive models (e.g. IWorthev et al. 
Trager et akl I2OO0I: iKuntschnd I2OO0I : iThomas et al 



1992 



2005 



Schiavonll2007l : iThomas et al.ll201oi r the high S/N that 



required made this method practical only for a limited num- 
ber of galaxies at limited redshifts so far. Also popular is 
the so-called full spectral fitting approach in which popu- 
lation par ameters are extracted by fitting the full available 



spectrum (I Heavens et a l. 2000 



Ocvirk et al. 



2006: 



Panter et al 



Cid Fernande s et al 
2007; Toieiro et al. 



2004 



20071 . 



2OO9I : IChen et al.ll2012l ') and which also requires high quality 



and high S/N spectra in order to succeed. These are often 
limited in wavelength coverage, notoriously time consuming 
and thus been limited so far in redshift and to small sam- 
ples. 

Fitting of broad-band photometry is the favourite approach 
at high redshift, because it is relatively cheap and yet - 
by covering a wide wavelength range as in the modern ap- 
proach (e. g. The Great Ob servatories Origins Deep Survey 
(GOODS) iDickinson et al.ll200S ) although at a coarser reso- 
lution compared to spectroscopy - can be effective in break- 
ing the age-metallicity degeneracy and in providing sensible 
physical properties for galaxies. It is common to exploit such 
data in spectral energy distribution (SED) fitting and many 
conclusions on galaxy evolution are based on these results. 
Hence, the interesting question is to assess how well galaxy 
properties can effectively be recovered with this method, 
and many papers have been devoted to this scope over re- 
cent years. The first papers that addressed the dependency 
of the derived galaxy properties at high redshift on the in- 
put st ellar population models a re lMaraston 1120051. he r eafter 
M05), Ivan der Wei et all (|2006l ') andlMa raston et al.1 (|2006l . 
hereafter M06). These works consistently found that stel- 
lar ages and masses, of observed objects are - not surpris- 
ingly - a function of the stellar evolution pattern adopted in 
the m odels used to derive them (for a review see IMarastonI 
l2011bl ). This information is useful, but does not yet inform 
us on which fitting method provides the correct galaxy pa- 
rameters. On the other hand, using mock galaxies for which 
the properties are known a priori allows one to understand 
the effectiveness of the assumptions made in the fitting, 
even though there is no guarantee that simulated galax- 
ies have star formation histories similar to the real ones 
in the Universe. In addition, different parameters can be 
recovered with different accuracies. For example it is well 
known that stellar m asses are generally b e tter recovered 
than ages and SFRs (iPapovich et al.l I2OOII : IPozzetti et al.l 



20071 : iBolzonella et al.ll2009l : iLee et aLJlioogJ : iMaraston et~aLl 



2OIOD 



In recen t years there have b e en sev e ral works in t his di - 



rection. 



- iLonghetti fc Saracc3 l|2009l ). IWuvts et all (|2009l ) 

and iLee et al.l l|2009r ) address the dependency of the 
results on fitt ing parameters using simulated galaxies. 
ILonghetti fc Saracco (2009 ) study the dependence of stellar 
mass estimates on a variety of models and their parameters 
like age, metallicity, IMF and star formation history (SFH) 
for early-type galaxies at 1 < 2 < 2. They assume differ- 
ent stellar population synthesis cod es for stellar evolution 
modeUing (|Bruzual fc Char loti [200^ . hereafter BC03; M05; 
and various others) . They simulate early-type galaxies using 
BC03 stellar population models assuming exponentially de- 
clining star formation rates (with timescale r). Furthermore, 
they superimpose secondary star bursts with a smaller value 
of r at later epochs. They find that stellar masses can not be 
recovered be t ter th an a factor of 2-3, at fixed known IMF. 
IWuvts et al.l (|2009l ) use mock galaxies from a hydrodynami- 
cal merger simulation based on BC03 models. They simulate 
all types of galaxies, but focus on a small redshift interval 
between 1.5 and 3 and also do not explore metallicity ef- 
fects. They conclude that properties of spheroidal galaxies 
(which means galaxies with little star-formation) are gener- 
ally well reproduced, whereas those of star-forming galaxies 
suffer from large uncertainties. 

I Lee et al.l ((20091) focus on mock Lyman break galaxies at 
z ~ 3.4, 4 and 5 based on BC03 stellar population models. 
They find that masses and SFRs tend to be underestimated 
and mean ages overestimated. They attribute these effects 
to the difference in SFH between the mock s and the r - mode l 
template used in the fitting. Furthermore, I Lee et al.l (|2009l ) 
show that a large wavelength coverage from optical to rest- 
frame near-IR is required to best recover the input physical 
parameters. 

In this paper we follow the same approach as these works. 
We perform our investigation by using both mock star- 
forming as well as passive galaxies, for which the physical 
properties - age, metallicity, reddening, mass and star for- 
mation rate - are well defined. We then treat these mocks 
as observed data and SED-fit them with several different 
setups. The comparison between input properties and those 
derived from the fitting allows us to understand and quan- 
tify the robustness of the derived properties as a function of 
the fitting parameters. 

Our work is complementary to these earlier works on sev- 
eral aspects. Firstly, it relies on the M05 stellar population 
models both as ingredient of the fitting and of the mock 
galaxies, but also explores the case of using different stel- 
lar population models in the mock and in the fitting. Also, 
a wider redshift span is considered, down to redshift 0.5, 
and a larger set of parameters in terms of metallicity and 
reddening prescriptions. Moreover, a detailed exploration of 
the effects of wavelength coverage and photometric filters is 
performed. This latter analysis aims at providing a practi- 
cal guide for planning observational surveys and proposals. 
Finally, we provide scaling relations between properties ob- 
tained with different fitting setups and population models, 
which will help unifying results obtained from different data 

sets and using different modelling. 

This work complements a recent paper IMaraston et al.l 
(|20ld . hereafter MIO) in which we focused on real 2; ~ 2 
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star-forming galaxies from the GOODS survey and stud- 
ied how well - or how bad - star formation histories that 
are typically adopted for the fitting templates are able to 
recover the galaxy star formation rates, stellar masses and 
reddening. One of the main conclusions of this work is that 
- because of the overshining effect from massive stars even 
if present in small proportions - essentially no star forma- 
tion history is able to provide an accurate determination of 
star formation rates, reddening and masses if the age is left 
as a free parameter in the fit. This happens because the fit 
tries to match the light from the most massive stars, which 
is best accomplished with a low age. As a main conclusion 
a template was found that was able to recover the galaxy 
properties very well, which is an exponentially increasing 
star formation - that we named inverted-r models as op- 
posite to the usually assumed declining star formation or 
r-model but with two important priors, namely: i) that the 
starting of star formation was at a much earlier epoch with 
respect to the time of observation (z ~ 5); ii) that the typi- 
cal timescale for star formation (r) could not be too short, 
namely r > 0.3 Gyr. This conclusion is able to provide in- 
formation on the actual star formation mode of high redshift 
star-forming galaxies, which triggers a lot of astrophysical 
consequences. 

In the present paper we do not pursue the aim at unveil- 
ing the galaxy star formation mode. Rather, we quantify 
the galaxy property offsets that are obtained with usually 
adopted templates expanding upon the investigation of MIO 
by exploring a wider parameter space in terms of reddening, 
metallicity, etc.. 

For real passive galaxies at z ~ 2, M06 found that a fitting 
setup including a wide range in metallicities, star forma- 
tion histories and reddening laws was optimal as the derived 
best-fitting star formation histories were able to match the 
observed strength of the Mg2800 line-strength in the galaxy 
spectra, which was regarded as a test independent of the 
photometric SED-fitting. In the present work we test the 
same fitting setup on mock passive galaxies and we reach 
similar conclusions. 

The paper is organised as follows. We introduce the mock 
galaxy samples in sections [2l The method of SED-fitting and 
the different fitting setups are described in section[3l Results 
and discussion are presented in section [l] The comparison 
of our results with the literature is carried out in section [S] 
In section [6] we provide the scaling relations. We summarize 
our work in section [T] 

Throughout the paper we use a standard cosmology of 
Ho = 71.9 km/s/ Mpc. ^a = 742 a nd Qm = 0.258 as 
used in GallCS fsee lHatton et al.ll2003l ) and magnitudes in 
the Vega system. 



2.1 Star- forming galcLxies from a semi-analytic 
model 

We extract model galaxies from the hybrid hierarchical 
model GallCS (Hatton ct al. 2003). This uses dark matter 
merger trees obtained with an N-body simulation covering 
the redshift range ~ 35 > z = 0. Baryonic physics is im- 
plemented semi-analytically following standard recipes (for 
details about GallCS see Hatton et al. 2003 and for a re- 
view see Baugh 2006). The latter includes gas cooling, star 
formation, chemical enrichment, feedback from supernovae 
and AGN and treats the dynamical interactions of galaxies 
in clusters and groups, such as tidal stripping, dynamical 
friction and mergers. 

The mass distribution inside each single galaxy is governed 
by its evolution. Cooling of hot gas into the dark matter 
potential wells forms disks where star formation occurs de- 
pending on the mass of the available cold gas and on the 
galaxy dynamical time with the star formation efficiency as 
a free parameter. Secular evolution in disks and mergers pro- 
duces bulges at the centres of disks. 

For each galaxy and its components (disk, bulge, burst) sep- 
arately the code produces rest-frame spectral energy distri- 
butions. We use the GallCS version in which these are based 
on the S alpeter IMF ( SalpeteJIlQSSh M05 stellar population 
models l|Tonini et al.ll2009l , l2010l ). We then apply d u st red - 



2 MOCK GALAXIES 

In the following we describe the samples of mock galaxies 
used to carry out our analysis. A summary of their properties 
is given in Table [T] 



dening to the total spectra. Following iTonini et al.l l)2009l l. 
we adopt a Calzetti extinction curve and a colour-excess 
E{B — V) proportional to the galaxy star formation rate, 
parameterized as E{B - V) = 0.33 • {Log{SFR) -2) + 1/3 
and E{B-V) = for SFRs less than 10 M^/yr. This param- 
eterization is based on the reddening derived for a sample 
of real star-formin g galaxies at z ~ 2 in the GOODS fields 
(JDaddi et al.ll2007l . MIO). 

We do not randomize the inclination of disk galaxies (which 
reduces the dust effect in face-on objects) but we redden the 
spectrum by the total amount of extinction calculated for 
each galaxy, theref ore considering the m aximal reddening for 
each object. As in lTonini et al.l (|2010|) the mock catalogues 
are built by redshifting the (unreddened and reddened) spec- 
tra in each simulation timestep to the corresponding redshift 
value, obtaining the observer's frame spectra. We then fil- 
ter the spectra with a chosen set of filters to obtain the 
observer's frame broad-band magnitudes. Finally, we scat- 
ter the magnitudes with Gaussian errors with 3 a errorbars 
where a — 0.1. In the fitting we adopt errorbars typical of 
those of the COSMOS survey l|Capak et al.ll2007^ for mock 
galaxies at redshifts ^ 1 (~ 0.3 mag for near-IR bands and 
~ 0.05 mag otherwise) and th ose of GOODS star-forming 
objects (|Giavalisco et al.ll2004r i for mock galaxies at redshift 
> 2 (~ 0.1 mag). A minimum photometric error of 0.05 mag 
is applied in the fitting when the given photometric error in 
this band is smaller than 0.05 mag. We use a sample of 100 
mock galaxies in both flavours - with dust reddening added 
as described above and without dust reddening - at each of 
the five redshifts z=0, 0.5, 1, 2 and 3, respectively. 
The fitting method is described in section |31 
The stellar population properties of the mock galaxies of 
the full merger tree are shown in conjunction with those 
of the 100 galaxy sample used in this paper in Figs. [T]|31 
The mass-weighted ages of galaxies at z~0, 0.5 and 1 are 
confined around a few Gyr (Fig. [ij left). However, star for- 
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Figure 1. Left: Mass-weighted ages of mock star-forming galaxies at redshifts 0, 0.5, 1, 2 and 3 for the entire merger tree (black) and 
the sub-sample used in this paper (red). Right: Age distribution. 



mation is never completely shut off in semi-analytic models 
such that low redshift objects still contain a small fraction of 
populations that are much younger than the mass-weighted 
age. This is shown in Fig. [T] right hand panel. Therefore, 
the intrinsic stellar populations of these galaxies cover a 
much broader age range than those at higher redshift. This 
will have consequences on the capability of simple compos- 
ite models to recover the actual properties of these galax- 
ies (see section |4|. Mass- weighted metallicities increase with 
redshift, but hardly reach half-solar metallicity by z~0 (Fig. 
[5]). Note, that the latest formed populations have indeed so- 
lar and super-solar metallicities but their contribution to the 
total stellar mass is very small. At high redshift, the mass- 
weighted ages of most galaxies are younger than 1 Gyr, and 
much more similar one to each other with respect to those 
at low redshift. As expected, metallicities are lower. 
As a further illustration we show the typical star formation 
histories of galaxies at redshift 0.5 and 2 in Fig. [S] Galaxies 
at redshift 2 show predominantly increasing star formation 
rate while for most galaxies at redshift 0.5 the star forma- 
tion rate decreases with time. Finally, star formation rates 
are shown as a function of stellar mass in Fig. 3] The number 
of low mass objects decreases towards z=0.5. Most objects 
have very low star formation rates. 

It is important to notice that - from the stellar population 
properties point of view - galaxies at z=2 and 3 and galax- 
ies at z=0.5 and 1 are similar, and with regard to various 
assumptions in the fitting. The wavelength coverage is ob- 
viously redshift sensitive and conclusions may be different. 
Note that we do not show the z~0 objects further because 
they are more easily accessed by other diagnostic methods 
such as spectroscopy, and we shall study them in a separate 
publication. 
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Figure 2. Mass-weighted metallicities of the mock star-forming 
galaxies at redshifts 0, 0.5, 1, 2 and 3 for the entire merger tree 
(black) and the sub-sample used in this paper (red). 



2.2 Passive galaxies 

Star formation in the semi-analytic model is never com- 
pletely shut off. Therefore, each galaxy will always contain 
young and bright stars although their contribution to the 
total mass may be very small. In order to study the case 
of truly passive and old galaxies without any young stars, 
we consider a set of solar metallicity, Salpeter IMF, simple 
stellar populations of M05 and place them at the various 
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Table 1. Properties of mock star-forming and mock passive galaxies. Values given for mock star-forming galaxies refer to the sample 

used in this paper, not to the entire merger tree. 

^ This reflects the range in ages of the oldest population. 

^ This reflects the range in mass-weighted metallicities. 

^ E{B -V) = 0.33 ■ (Log{SFR) - 2) + 1/3 and E{B - y) = for SFRs less than 10 Mo/yr for mock star-forming galaxies. No internal 

dust reddening was applied to the mock passive galaxies. 

■* Observed K-band magnitudes from unreddened spectra after application of randomisation with magnitude errors. For mock passive 

galaxies the brightest magnitude refers to the youngest and most massive object and the faintest to the oldest and least massive object 

at the given redshift. 
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Figure 3. Selection of star formation histories of semi-analytic galaxies at z=0.5 and 2. The zeropoint of the lookback time lies at the 
given redshift. Galaxy star formation rates at z = 2 increase, while those at 2 = 0.5 predominantly decline. Galaxies at redshift 2 are 
younger than those at redshift 0.5 (see Fig.[T}. 



redshifts. We treat the photometry of mock passive galaxies 
in a similar way to that of mock star-forming galaxies, i.e. 
the observer's frame spectra are convolved with broad-band 
filter bands and the resulting broad-band magnitudes are 
scattered with Gaussian errors with 3 a errorbars where 
(T = 0.1 mag. Photometric errors in the fitting are set to 
0.1 mag. The ages of mock passive galaxies range from 1 
to 7 Gyr a,t z — 0.5 and from 1 to 1.5 Gyr at 2 = 3 (see 
Table [T] for details). Stellar masses vary between 10^"'^ and 



10^^ Mq in steps of 0.25 dex. Internal reddening was not 
added to these passive galaxies. 

The choice of a SSP as star formation history for passive 
galaxies at high redsh ift might not be very realistic and 
indeed studies by e.g. ICimatti et al.l (2008) indicate that 
short-lived starbursts with duration times of ~ 0.1 Gyr 
are required in order to reproduce some of the observed 
early-type galaxies at 1 < 2 < 2. However, a SSP reflects 
the most extreme case which in conjunction with the 
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Cloud by iFitzpatrick' (l986^, Small Magellanic Cloud by 
IPrevot et al. (1984). Ca lzetti's law for local starburst galax- 
ies ( Calzetti et al.ll2000l l and no reddening. 



10 11 

log M*G.i,cs/Mo 

Figure 4. Stellar masses and star formation rates of mock star- 
forming galaxies at redshifts 0.5, 1, 2 and 3 for the entire merger 
tree (black dots). The sub-sample used in this paper is marked 
in red. In the bottom right of each panel the build-up of the red 
sequence towards lower redshift is visible. 



mock star-forming galaxies enclose cases in which star 
formation might have lasted a short time before passive 
evolution set in. Thus the case of real passive galaxies at 
high re dshift will lie in between th ese two boundaries. We 
refer to iLonghetti fc Saraccd ()2009l ) for the study of mock 
passive galaxies with exponentially declining star formation 
histories. 



3 SPECTRAL ENERGY DISTRIBUTION 
FITTING 

Spectral Energy Distribution (SED) fitting is based on the 
comparison between theoretical template spectra and ob- 
served broad-band magnitudes. The SED-fitting is carried 
out in the same way as described in M06 and MIO. We use a 
modified version of the HyperZ -code (BolzoncUa et al. 200Q) 
in which redshift can be fixed at its spectroscopic value for 
each object {HyperZspec, M. Bolzonella, private communi- 
cation). The fitting method is based on a x^-minimisatiorij. 
The xt is computed for templates covering a broad range 
of star formation modes, ages and metallicities. The combi- 
nation of parameters that provides the minimum x^ is the 
best-fit solution. 

HyperZspec provides various empirically-derived la ws to 
treat galaxy internal reddening: Mi lky Way by lAUenl 
l|l976t ). Milky Way by ISeatonI ll979i ). Large Magellanic 



X^ is defined as x^ = 2_/ 



-bxFt^ 



in which 

F'obs i S'Ud Ftemp,i are the observed and template fluxes in fllter i, 
CTi is the photometric uncertainty and b is a normalisation factor. 
The reduced xi is — with u as the number of degrees of freedom. 



Galaxy ages are constrained to be younger than the age of 
the Universe at each redshift. 

The best fit solution yields the galaxy age, star formation 
law, metallicity and dust reddening. The age is defined as 
the time elapsed since the onset of star formation, which is 
therefore the age of the oldest stellar population present in 
the template. The normalisation between best fit template 
and observed SED allows us to calculate stell ar mass and 
star f ormation rate using codes developed in iDaddi et al.l 
(l2005h and M06. 

SED templates are based on the simple and composite stel- 
lar population models of M05. In the following subsections 
we describe the variety of template and filter setups in use. 



3.1 Model Template Setups 

3.1.1 Wide template setup 

We a dopt a mixed template setup as in iMaraston et al.l 
(|2006l . hereafter wide setup) as default setup. It consists of 
32 types of theoretical spectra which cover a wide range of 
star formation histories - specifically: i) SSP (single star- 
burst), ii) exponentially declining SFR (so-called r-model 
with T — 0.1,0.3 and 1 Gyr), iii) truncated SFR (constant 
star formation for a time t with t = 0.1,0.3 and 1 Gyr, 
no star formation afterwards) and iv) constant SFR - and 
metallicities oi I Zq, \Zq, Zq and 2 Zq . A Salpeter IMF 
is assumed, consistent with the mock galaxies, but we also 
explore different IMFs. 

Despite the increase in CPU time that is required to fit 
data with the wide setup, it was found by M06 that this 
approach gives the most accurate results for the sample of 
passive galaxies in their study. For example, as an indepen- 
dent constraint, they find that predictions for the Mgc/v line 
strength from the best SED-fit agree very well with the ob- 
serv ed line strengths in the GOODS-S sample IjDaddi et al.l 
[200a). 

However, as well known, passive galaxies are easier to fit 
than star-forming ones and also metallicity effects are more 
evident in passive spectra. Hence, the result of M06 may not 
be confirmed for other classes of galaxies. 
In order to study the effect of metallicity, we also obtain 
best fit solutions using the wide setup in mono-metallicity 
form. Furthermore, we retrieve separate solutions for the full 
setup in combination with different reddening laws to reveal 
their effect on the fitting. 

We also consider an identical setup based on BC03 models 
(hereafter BC03 setup). It is well known that these models 
mostly differ from the M05 model s in the energetic s of the 
TP-AGB and at the young ages (|Marastonll2011al '). From 
this viewpoint, these two stellar population models repre- 
sent two extremes whic h enclose other ones , e.g. Chariot 
fc Br uzual (CB07), (e.g. ICimatti et al.ll2008l : IConrov et"aLl 
[2OO9I). As BC03 models are widely used in the literature 
using them for the same set of mock galaxies allows us to 
provide scaling relations which help translating the results 
obtained with BC03 models to M05 models and vice-versa. 
Results are summarised in Appendix [Cl 
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3.1.2 Only-T setup 

Exponentially declining SFH or r-type models are a com- 
mon choice in the literature (e.g. Pa povich ct al. 200ll : 
IShaplev et alll2005l : [Pozzetti et al.[[2007l : lllbert et al.ll201ol ). 
In MIO we showed that for real star-forming galaxies as well 
as mock star-forming galaxies at z=2 r-models do not pro- 
vide a good approximation of the star formation history. 
Here, we try to quantify the effect of this mismatch by adopt- 
ing a template setup which contains r-models (only-r setup, 
with r = 0.01,0.05,0.1,0.2,0.5,1,2 and 5 Gyr) and con- 
stant star formation , solar metallicity and a Salpeter IMF 
as in IShaplev etl l' ('2005) (but based on M05) . Further- 
more, IShaplev et al. (_2005) used a Calzetti type reddening 



law and models from BC03. We explore these templates by 
exploiting various reddening laws. 



these are affected by overshining; our mock passive galaxies 
are by definition devoid of thi^J. 



3.1.5 Varying the IMF 

A further interesting question to address is the sensitivity of 
the SED-fit to IMF choices as the IMF is a free parameter 
in population synthesis models. Commonly a conversion fac- 
tor between Salpeter and other IMFs is assumed to convert 
stellar masses and other quantities, for example masses ob- 
tained with a Kroupa IMF are smaller by a factor of ~ 1.6 
when compared to Salpeter (M06). Here, we have the op- 
portunity to assess whether a simple scaling is sufficient. 
We use the wide setup with templates bas ed on a variet y 
of differen t IMFs, namely Salpeter, Kroupa l|Kroupall200in . 
Chabrier (|ChabrieiJl2003l ) and a top- heavy IMF (with slope 
a:: ~ 0). For the C habrier and top-heavy IMFs we adopt the 
slopes as given in Ivan Dokkum (2008). 



3.1.3 Inverted-T models 

Exponentially increasing SFHs (so-called inverted-r models) 
were put forward in MIO. It was found there that - with 
priors on the galaxy formation redshift and the typical r - 
these models were the only ones able to recover the SFR 
and the reddening that was determined from independent 
indicators such as far-IR luminosities or the UV-corrected 
slope. In MIO we also used a set of mocks from this paper 
and show that the same setup also gave excellent results on 
the SFR-mass-relation of semi-analytic galaxies. 
In this paper we extend the test to mocks at other redshifts, 
using the same assumptions, namely a formation redshift of 
~ 5, which translates into ~ 1 Gyr age models for fitting 
~ 3 galaxies and ~ 4 Gyr age models for 2 < 2. 



3.1.4 Age Grids 

We also investigate intrinsic properties of the templates 
themselves, in particular the age grid. Templates normally 
get rebinned from 221 to 51 values in age within the Hy- 
per Z spec-code. The original age grid is equally spaced in 
logarithmic space. Hence, the rebinning has a larger effect 
on younger ages. The latest version of the code considers 221 
ages without rebinning. In both cases minimum and maxi- 
mum age are 100,000 yrs and 20 Gyr, respectively. We adopt 
the finer age grid of 221 ages as default and test the effects 
of using the wide setup with a rebinned age grid. 
Furthermore, we study the effect of excluding very young 
ages in the fitting by limiting the minimum age to 20 
Myr and 100 Myr. We showed in MIO that solutions ob- 
tained with the exclusion of the youngest ages are less 
biased by very young and bright stellar populations that 
dominate the emitted light, thus disabling their outshin- 
ing effect. Similar min imum age constraints are applied by 
iBolzonella et al.l (|2009l ) and lWuvtset all (|200i) in order to 
exclude very young solu ti ons a nd to improve the mass es- 
timate. IBolzonella et al.l (120091 ) verified this through tests 
with simulations. 
Age grids are only varied for mock star-forming galaxies as 



3.2 Wavelength range included in the fitting 

Which wavelength coverage is required in order to gain 
robust results from SED-fitting? How many broad-band 
filters should be used? These are key questions in galaxy 
evolution studies especially for the planning of galaxy 
surveys and observational proposals and for understanding 
the reliability of the results. In order to answer these 
questions we carry out a comprehensive test of various 
filter setups and their performance in recovering the 
galaxy physical properties. M06 discuss the importance of 
including the rest -frame near-IR in the fitting (illustrated 
in their Figure QV lKannappan fc Gawiserl (|2007h also point 
out the dependency of stella r mass est imates on filter 
type and wavelength coverage, lllbert et al.i (2010) also find 
differences in the derived stellar masses depending on the 
inclusion or exclusion of the rest-frame near-IR for galaxies 
at 2: > 1.5 and conclude t hat the rest-fr a me n ear-IR should 
be included in the fitting. IShaplev et al.l (|2005l ) on the other 
hand conclude that the near-IR does not carry additional 
information with respect to a fitting c arried out in the 
optical, using BC03 population models. Ivan der Wei et al.l 
(2006) even conclude - in combination with the M05 stellar 
population models - that including the near-IR could even 
be damaging the results. 

The light of a galaxy is a superposition of the light emitted 
by its stellar populations and the wavelength range in which 
a single stellar population emits most of its light changes 
with age. Hence, by sampling the entire spectrum there is 
generally a better hope to trace all different stellar popula- 
tions in a galaxy whereas restricting the filter set to certain 
wavelengths may result in over-emphasising the contribu- 
tions of specific populations. In order to better understand 
the extent of this effect, we explored the following filters sets: 

(i) UBVRI JHK IRAC (3.6, 4.5, 5.8 /im), UBVRI JHK, 
UBVRI JH, UBVRI J, UBVRI 
(ii) UBVRI IRAC 



^ We have verified that a minimum age constraint of 0.1 Gyr has 
no effect on the final results for the mock passive galaxies. 
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(iii) BVRI JHK IRAC, VRI JHK IRAC, RI JHK IRAC 

(iv) u' g' r' i' z' (SDSS) [z=0.5] 
(v) UBVRI Y JHK IRAC 
(vi) BRIK 

Obviously, the assumption that all mock galaxies are de- 
tected in all filter bands is an idealised case. In particu- 
lar, the majority the mock passive galaxies at 2: ^ 2 would 
not be detected in filter bands bluer than V (I for log 
M*^ 10.5) in even the deepest surveys, such as e.g. GOODS 
IJGiavalisco et al.ll2004l ). because of their old age and SFH. 
However, this case is addressed when filter setups that omit 
the bluest filter bands are used in the fitting. 
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3.3 Photometric uncertainties 

In order to quantify the effect of photometric uncertainties 
we use two different catalogues of the same 100 galaxies at 
z — 0.5. One catalogue contains the magnitudes that were 
scattered with Gaussian errors as described in section [51 the 
other the true magnitudes without this scatter. For this ex- 
ercise we focus on the mock catalogues created from unred- 
dened spectra. We derive the stellar population properties 
with the exact same method using our default setup and no 
reddening in the SED-fitting. The exclusion of reddening in 
the catalogues and fitting allows us to single out the effect 
of the photometric uncertainties without being affected by 
the age-dust degeneracy. 

The influence of photometric uncertainties for the sample 
of mock passive galaxies is determined in the same way but 
using only a solar metallicity SSP as template to provide the 
exact match in star formation history to the mock passive 
galaxies. 



4 RESULTS 

In this section we compare the galaxy properties derived 
from SED-fitting of mock galaxies to the true values. The 
comparison is performed as a function of template setup 
(star formation history, metallicity, age grid, IMF, stellar 
population modeO), filter set and wavelength coverage, and 
reddening law. For the semi-analytic galaxies, we first pro- 
duced mock catalogues without dust reddening to better 
single out the contributions of the multiple stellar popula- 
tions building up the galaxies. For these unreddened galaxies 
we also do not consider reddening in the fitting. In the fol- 
lowing, we refer to this as the 'case without reddening' or 
the 'unreddened case'. Although ignoring the existence of 
dust when simulating star-forming galaxies is an unrealistic 
case it provides an insight into the impact of the age-dust 
degeneracy. The procedure is then repeated including dust 
reddening, both in the mock catalogue and the fitting. For 
the mock passive galaxies no internal dust reddening has 
been included, but both the cases of including and exclud- 
ing reddening in the fitting are studied. Table lAll gives an 
overview of all fits that were carried out. 
In order to evaluate the optimal template and filter sets - 



Figure 5. Minimum x^ of best fits with alternative setups in 
comparison to the wide setup, using UBVRIJHK-I-IRAC bands 
for mock star-forming galaxies at redshift 2. Black symbols show 
the un-reddened case (internal dust reddening is not included 
in the mock galaxies and the fitting is carried out without dust 
reddening) . Red symbols show the case with dust reddening. The 
same results are found at all other redshifts. 



intended as the one for which fitted and input properties are 
closest - we define a quality estimator Q similar to a x'^: 



Q 



/E(A)= 



with A = X 



i, fitting Xi^input: 



i being the number of objects and x representing age, metal- 
licity, reddening, stellar mass or SFR, respectively. The op- 
timal value of Q and A is zero. 

A first indicator of the individual performances of the var- 
ious template setups is the xi oi the best fit. We show re- 
sults for the mock star-forming galaxies at z=2 in Fig. [5] 
as an example. For nearly all objects the wide setup pro- 
vides a better xl in comparison to the only-r setup due to 
the wider template library with regard to metallicity and 
star formation histories. However, we showed in MIO that 
models with the smallest xt s-re not necessarily a better 
physical solution. This is further confirmed here by the fact 
that a BC03-based setup shows a better xt in some cases al- 
though the mock galaxies are based on the M05 models and 
therefore their SED is intrinsically differentjj. This example 
highlights again how misleading the mere consideration of 
the minimum xl can be. 

In the following we focus on the ability of different template 
setups and wavelength coverage to recover the age, metal- 
licity, SFH, reddening and stellar mass. In order to ease the 
comparison between different setups and fitting parameters 
we always provide the results for the wide setup as reference 
in each figure. Furthermore, we list the ranges containing 
68% of the solutions along with the median in appendix [B] 



■^ without dust reddening 
^ see Appendix [Cl 



^ We summarise the results for fits with the BC03-setup to M05- 
based mock star- forming galaxies in Appendix [C] and vice versa 
in Appendix Id) 
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Figure 6. Best fit ages (y-axis) compared to input ages (x-axis) of mock star-forming galaxies for the wide and only-r setups at redshift 
0.5 and 2 (top and bottom panels, respectively). Symbols as in Fig.fS] Left: Comparison with the age of the oldest population. Middle: 
comparison with V-band luminosity-weighted ages. Right: Comparison with mass-weighted ages. 



4.1 Star-forming galaxies 

4.1.1 Age 

The comparison of fitted ages with intrinsic ages aUows us to 
identify which age (mass-weighted age, luminosity-weighted 
agqj, age of the oldest population) is actually recovered by 
the fit. As discussed in MIO, by definition, the age parame- 
ter t of the templates and the fitting denotes the age of the 
oldest population that is present, as it marks the starting 
of star formation (see MIO for details). However, as one fits 
the integrated luminosity the true age is difficult to extract, 
particularly for galaxies with extended star formation due 
to the overshining effect. For example, it is known that fit- 



° Ages are weighted by the V-band luminosity of the correspond- 
ing SSP models. We also consider the bolomctric luminosity. 
Overall, the latter provides younger ages in comparison to the 
V-band luminosity weighted ages, but the overall trends are sim- 
ilar. 



ting single-burst models to data give 'luminosity-weighted' 
quantities, which mostly reflect the properties of the most 
luminous generation. 

In Fig. [6] we show the comparison for the wide and only- 
T setups at redshift 0.5 and 2. We find that ages derived 
for dust-free spectra agree best with mass-weighted ages, 
particularly at high redshift. This is somewhat a new con- 
clusion, and suggests that the use of composite templates 
allows to obtain ages that are closer to the mass-weighted 
ones than to the luminosity-weighted ones. The youngest 
fitted ages (<~ 10* yr) are underestimated most. With a 
wide setup mass-weighted ages are recovered in 68% of the 
cases between -0.91 dex and 0.07 dex in the unreddened 
case (-2.83 dex and -0.37 dex in the reddened case) for old 
galaxies with little on-going star formation at low redshift. 
The median offsets between recovered and mass-weighted 
age are -0.35 dex and -1.78 dex in the unreddened and red- 
dened case, respectively. At higher redshift mass-weighted 
ages are recovered better (see Table IB2[) . At high redshift 
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fitted ages are similarly comparable to the oldest ages while 
most are underestimated for the oldest galaxies (at z=0.5). 
l/-band luminosity-weighted ages are predominantly overes- 
timated for objects with higher SFRs (at z=2) and predom- 
inantly underestimated for objects with little star formation 
(at z=0.5), the latter is due to the overshining. 
Since the range of available ages in the fitting narrows to- 
wards higher redshift, differences in best fit age between 
template setups and between recovered and true age de- 
crease towards higher redshift, particularly for ages ^ 10** 

yr- 

For the reddened case, the age-dust degeneracy is strong 
and causes ages to be underestimated (by up to nearly 3 
dex in the worst cases). The effect is less severe at higher 
redshift because of the smaller available age range. Here, 
only the youngest ages (below 10* yr) are underestimated. 
For young and dusty star-forming galaxies (those at high 
redshift) the reason is twofold. Firstly, a small contribution 
stems from the age-metallicity degeneracy. Template metal- 
licities are much higher than mock galaxy metallicities (com- 
pare Fig. [2]). Thus, bluer colours are reproduced by younger 
ages. 

Secondly and dominating, template SFHs do not match 
those of the semi-analytic galaxies resulting in the overesti- 
mation of some mass- weighted ages (at z = 2). 
For older galaxies with little on-going star formation (those 



at low redshift) and in the reddened case very young ages 
(< 10* yr) and a large amount of dust are obtained with the 
SSP component included in the wide template setup. The 
fit is clearly fooled by the overshining in combination with 
the age-dust degeneracy (see iRenzinil 120061 . for a detailed 
description of this degeneracy and its effects). Semi-analytic 
galaxies at z < 1 are hardly reddened due to their low SFRs 
(see section [21 • Allowing for dust reddening in the fit drives 
the ages to younger values since the addition of dust mimics 
redder colours (see also Fig. 1101 this effect is less strong for 
2 = 2 and 3 as the simulated galaxies are dust reddened). 
With the only-r setup most of these objects are fitted with 
significantly less dust, longer r, higher metallicity and older 
ages. 

Very young ages from the only-r setup are due to the SFH 
and metallicity mismatch. In general the age determination 
is better at high redshift (see Tables iBll and lB2l for median 
values and 68% ranges). Deviations at low redshift are large 
because of on-going star formation and the wide age range 
covered by the stellar populations in a single galaxy (Fig. 

In order to single out the effect of metallicity on the derived 
ages, we used mono-metallicity template setups, keeping the 
variety of star formation histories and IMF of the wide setup. 
Overall, the effect is small and therefore, it is not shown here. 
When reddening is switched off the effect is very clear: when 
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Figure 8. Ages derived from SED-fitting as a function of wavelength for redshift 0.5 and 2 in tlie case witliout dust reddening. The red 
histogram shows the distribution of the mass-weighted age. 



the template metallicity is high, sub-solar metallicity galax- 
ies are fitted with a younger age to compensate this mis- 
match. When reddening is introduced into the procedure, 
the age-dust degeneracy partly compensates metallicity ef- 
fects. Only when the metallicity is overestimated the most, 
the number of objects with a younger best fit age increases 
significantly, particularly for the metal poorest objects (at 
high redshift). 

We now investigate how the differences in the model-SED 
due to different IMFs affect the derived best fit ages. These 
differ substantially between the various IMFs (Fig.[7ll, espe- 
cially for a top-heavy IMF. Again, the case without red- 
dening helps understanding the effect. At 2 = 3 and 2 
top-heavy IMF models give systematically older ages than 
Salpeter IMF models. The excess of massive stars and thus 
UV light for very young ages of the top-heavy IMF models 
is compensated by on older age. The same is true at low- 
redshift where ages for a top-heavy IMF are systematically 
older than ages for a Salpeter IMF. Ages derived with a 
Chabrier and Kroupa IMF are much closer to those derived 
with Salpeter. At intermediate ages (i.e. in the intermedi- 
ate stellar mass range) the SED of a Kroupa and Chabrier 
IMF is slightly redder because of the effect on the evolu- 
tionary flux. This is compensated by younger ages. When 
reddening is included scatter increases, particularly at low 
redshift, due to the age-dust degeneracy. At high redshift 
ages obtained with top-heavy IMF templates are now much 
older than those of Salpeter IMF templates (and the fitted 
reddening is higher). 
Finally, we study the dependence of the derived ages on the 



adopted wavelength range in the fitting which is shown for 
the unreddened case in Fig. [8] for a few common filter sets. 
Since the emission of stellar populations of different ages 
peaks at different wavelengths one expects that a narrow 
wavelength range in the fit biases the result towards par- 
ticular ages. In the unreddened case the effects are small 
for the oldest galaxies (z=0.5). Excluding IRAC filters and 
using only optical wavelengths in the fitting makes derived 
ages somewhat older. For higher redshift and intrinsically 
younger objects the opposite is the case. The more the 
wavelength coverage is focussed towards the rest-frame UV, 
the younger the derived ages. Neglecting blue filter bands 
has a slightly rejuvenating effect for the oldest galaxies, for 
younger ones {z ~ 2) the effect is smaller because the rest- 
frame UV is excluded and underlying older stellar popula- 
tions are emphasised in the fit. The inclusion of reddening 
has little impact on the general dependence of derived ages 
on wavelength coverage and is therefore not shown. At low 
redshift, however, the age-dust degeneracy dominates. 
In summary, fitted ages compare best to mass- weighted ages 
and not to luminosity-weighted ages, as widely believed. 
The difficulty in recovering the oldest age remains which 
is mostly related to mismatches in the star formation his- 
tory. In conclusion and perhaps not surprisingly, SSPs or 
r-models with low r's should not be used when dealing with 
star-forming objects. 

Since using mono-metallicity template setups has a very 
small effect on the derived ages compared to using a wide 
set of metallicities because age effects dominate over metal- 
licity effects in star-forming galaxies, it is sufficient to use 



© 0000 RAS, MNRAS 000, 000-000 



12 Pforr, Maraston & Tonini 






'o 



UBVRIJHK+IRAC 
z=0.5, wide 

® ® occocwosaiKoo 



UBVRIJHK+IRAC 
z=2, wide 

e •oeo®o 




metallicity|^»^=j as [Z/H] 

Figure 9. Comparison between recovered and mass-weighted 
metallicity of mock star-forming galaxies at redshift 0.5 and 2 
for the wide setup. Black and red symbols refer to the cases with- 
out and with dust reddening, respectively. Template metallicities 
are -0.7, -0.33, and 0.35 (from a fifth solar metallicity to twice 
solar metallicity). The line of equality is shown. 



mono-metallicity template setups for star-forming galaxies. 
The choice of IMF affects ages such that the wrong IMF can 
be compensated by a different age. The largest effects are 
seen for the IMF that is most discrepant (top-heavy IMF 
here) to the input IMF. 

The wavelength coverage plays a crucial role in the determi- 
nation of ages such that a restriction in wavelength coverage 
results in younger ages at high redshift. At low redshift ages 
are underestimated due to the age-dust degeneracy and the 
wrong SFH. Because of the mismatched SFH the derived age 
resembles the real one, e.g. the oldest population present, in 
hardly any case. 

Median offsets between recovered and mass-weighted ages 
for various fitting setups are listed in Tables IBll and IB2I 



4.1.2 Metallicity 

We know that most template metallicities are too high com- 
pared to input metallicities. Consequently, metallicity is 
overestimated for most objects, particularly at high redshift 
(Fig. I9J. This is the case even when reddening is switched 
off and demonstrates the age-metallicity degeneracy. The 
highest metallicity templates of the wide setup are mainly 
chosen when reddening is allowed in the fit because of the 
added degeneracy with dust. These are best fit with a young 
and short starburst (SSPs or small values of r) and larger 
amounts of reddening. In a statistic sense the fit is sensitive 
to the metallicity, such that the number of best fit templates 
with low metallicities is higher at high redshift. The oppo- 
site is the case for high metallicities. 

The derived metallicities depend only little on the wave- 
length coverage, therefore we do not show it. The lack in 
wavelength coverage affects the most when only optical fil- 
ter bands are used such that most galaxies are best fitted 
with the highest metallicity templates. 

Overall, for star-forming galaxies, metallicities are poorly 
recovered because ages are poorly recovered, i.e. ages are 
underestimated and metallicities are overestimated, which 
in turn is driven by the poor match of the SFH and the 
overshining effect. 



4.1.3 E(B-V) 

We present the comparison between derived and input red- 
dening for the wide setup and illustrate the age-dust degen- 
eracy in Fig. 1101 The reddening is very well recovered at 
2: ^ 2 for the wide and inverted-r setups. Median offsets 
between recovered and true E(B-V) are and 0.01, respec- 
tively. 68% of the solutions have offsets between ~ and 
~ 0.1 (Table [B2)) . The only-r setup provides slightly larger 
reddening values. This is the same result that MIO found 
for real z ~ 2 star- forming galaxies. They showed that dust 
reddening obtained from a fit with inverted-r models agrees 
best with those derived from the rest-frame UV (compare 
their Figs. 16 and 20). 

At 2: < 2 on the other hand reddening is heavily overesti- 
mated for all setups, e.g. the median offset for a wide setup 
is 0.29 and 68% of the solutions lie within /^E{B - V) = 
and 0.59 (see Table iBl) . Accordingly, we find that fitted 
ages are younger at low redshift in the case with reddening 
compared to the case without reddening. When reddening is 
overestimated by E{B — V)^ 0.4 — 1.2 ages are lower by up 
to ~ 3 dex at redshift 0.5. This difference decreases towards 
redshift 3 where ages are also older than in the unreddened 
case (lower panel in FigllOp. 

Because the ability to recover reddening depends only little 
on the wavelength coverage, we only summarise the main 
points: 1) the exclusion of near-IR filter bands has almost no 
effect because dust reddening absorbs light predominantly at 
rest-frame UV wavelengths. 2) For that reason the exclusion 
of the bluest filter bands has the largest effect on the derived 
reddening, such that reddening is even more overestimated 
at low redshift and underestimated at high redshift. 
Compared to the full age grid shown in Fig. [10] a cap on 
minimum age reduces the overestimation in E(B-V) from 
maximal A_B(B - V) ^ 1.1 to 0.45 at z = 0.5 for all tem- 
plate setups. The median offset is improved by 0.15. The 
maximum age difference is now ^ 2 dex at the lowest red- 
shifts. At higher redshift the minimum age restriction im- 
proves the derived reddening only little and ages differ less 
between cases with and without reddening. Rebinning of the 
age grid does not improve the recovery of reddening at any 
redshift. 

Metallicity has only a small effect on the derived reddening, 
hence, we do not show it. E(B-V) is slightly more overesti- 
mated at high redshift for the highest metallicity template 
setup. Overall, the scatter is increased. 

In summary, reddening for both inverted-r and wide setups 
is well recovered for galaxies that are dust reddened; E(B-V) 
is recovered in 68% of the cases within —0.01 and ^ 0.10. 
This depends little on wavelength coverage, metallicity and 
age. For intrinsically dust-free galaxies (those at z=0.5), red- 
dening is heavily overestimated (68% within Ai?(_B — V^) = 
and 0.59) and ages are underestimated because of the age- 
dust degeneracy. 



4.1.4 Stellar Mass 

The choice of template setup also affects the mass recovery 
(Fig.lTTjI. With the wide template setup the mass recovery at 
redshift 2 and 3 is very good, particularly for masses between 
10^" and \Q^^ Mq. Masses are underestimated for lower-mass 
galaxies. These are small disks with increasing SFHs like the 
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Figure 10. Upper panel: Difference between derived and input E(B-V) as a function of redshift (0.5, 1, 2 and 3, respectively, from left 
to right) for the wide setup (black open circles) and for the inverted-T model (red dots, redshift 2 and 3). Lower panel: Difference in 
derived aees with and without internal reddening as a function of fitted reddening. 



ones shown in panel a) of Fig. [3] left-hand side. Indeed, their 
mass can be perfectly recovered using the inverted-T models 
(bottom panel in Fig. 1111 see below). Including dust affects 
the result only little. Median offsets between true and recov- 
ered mass are ~ —0.1 dex at z=2 and 68% of the solutions 
show offsets between ~ —0.3 dex to dex in the unred- 
dened case and ~ —0.5 dex to ~ dex when reddening is 
included. For older galaxies with little on-going star forma- 
tion (z=0.5) on the other hand, masses are underestimated. 
For these the median offsets with and without reddening in 
the fit are ~ —0.6 dex and ~ —0.3 dex, respectively. 68% of 
objects show offsets in stellar mass between ~ —0.5 dex to 
~ dex in the unreddened case and ~ —1.4 dex to ~ —0.2 
dex in the reddened case. When dust is included, this effect 
is mostly due to the age-dust degeneracy, whereas without 
reddening a different set of degeneracies dominates. Firstly, 
no available SFH provides a real match to the mock SFHs. 
As seen in Fig. [3] the typical SFH at low redshift is bimodal 
in the sense that star formation decreases after an initial pe- 
riod of increasing star formation. On-going star formation 
causes overshining, thus hiding most of the old underlying 
populations. This has the largest effect at redshift 0.5 be- 
cause the age distribution of the stellar populations building 
up a particular galaxy is much wider (see Fig. [6]). Secondly, 



template metallicities are discrete and for the bulk of stel- 
lar populations of about half the objects still too metal rich 
(compare Fig. [2] and Fig. [9]). The blue colour of a galaxy 
is matched by a template of low metallicity in combination 
with a longer star formation time scale and an older age, or 
a higher metallicity, shorter star formation time scale and 
a younger age. The latter leads to underestimate the mass. 
SSPs are the main cause for very young ages and conse- 
quently very low stellar masses in the wide template setup. 
Masses are underestimated by up to 0.8 dex. 
It is important to note that - if instead of picking up the best 
fit for those objects for which the mass is underestimated, 
one would average over neighbouring solutions (in the range 
of X^ + 1) " their mass estimate could be improved by up 
to 0.4 dex. Although masses are significantly improved for 
some of these objects, carrying out the procedure for all ob- 
jects only results in an average improvement of 0.06 dex in 
the unreddened case and 0.18 dex when dust reddening is 
included. 

An only-r setup can recover stellar masses at high redshift 
similarly well in the high-mass range, and due to the differ- 
ent star formation history, similarly badly in the low-mass 
range. For older galaxies with little on-going star formation 
at low redshift masses are underestimated. However, this 
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Figure 11. Stellar mass recovery at redshift 0.5 and 2 as a function of template setup, namely from top to bottom: wide, only-T, 
inverted-T. Red circles refer to cases with reddening, black dots to no reddening. Quality factors are given for the entire mass range for 
reddened and unreddened cases, respectively. Green triangles show the mass estimate for the youngest objects at z=0.5 with increasing 
SFH (see Fig. (Sjl when an inverted-r model of age 1.1 Gyr is used in the fitting. 



setup including reddening performs slightly better than the 
wide setup. 

At all redshifts masses are better recovered when the mock 
star-forming galaxies are unreddened, because there is no 
age-dust degeneracy (Fig. Ilip . At z = 0.5 masses are re- 
covered within -0.8 dex and -fO.3 dex for all template se- 
tups. When reddening is included the SED-recovered stellar 
masses are underestimated by up to ~1.6 dex for the least 
dust reddened and oldest galaxies in our sample. This stems 
from underestimating the age due to the age-dust degener- 
acy (Fig.©. 

The mass recovery improves with overall decreasing galaxy 
ages and age spread, for all setups which perform in a very 
similar way for z^ 1. Masses larger than 10^'^ Mq axe very 
well recovered for all template setups. Masses lower than 
10^-5 Mq are underestimated by ~ —0.7 dex at 2 = 3. 
Here, it is important to note that the mock galajcies have 
SFR of at most SOMo/yr (see Fig. 2}, whereas real galax- 



ies c an have higher star formation rates (e.g. iDaddi et al.l 
[2007]). Hence, these conclusions may not be extended to real 
galaxies, and indeed in MIO we showed that for z ~ 2 star- 
forming galaxies with higher SFRs, an only-r setup underes- 
timates the age, hence underestimates the stellar mass and 
overestimates the star formation rate. In MIO we have also 
verified that an inverted-r-type star formation history with 
high formation redshift is best for the derivation of physi- 
cal properties such as reddening and SFR and for the mock 
galaxies of this work for stellar mass (their Fig. 24). Here 
we reinforce the result by a starring mass recovery for mock 
galaxies at redshift 2 and 3 (Fig. [TT] bottom right) with 
inverted-r models. 

Obviously, the same models with the same priors do not al- 
low a good mass recovery for older galaxies at lower redshifts 
because they do not trace their correct SFH. However, if we 
change priors and use inverted-r models with formation red- 
shift (i.e. age) closer to the age of the oldest population, thus 
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Figure 12. Median recovered stellar mass of mock star-forming galaxies for mono-metallicity wide setups as a function of redshift, hence 
galaxy age (upper left to lower right). Metallicity in each panel increases from left to right. Red and black triangles refer to cases with 
and without reddening, respectively. Error bars reflect 68% confidence levels. 



simulating objects like the one shown in the bottom-left of 
Fig. [3l the stellar mass is recovered very well (green trian- 
gles in Fig. Ilip . However, the SFHs of real galaxies are not 
known and a distinction between objects of increasing and 
decreasing SFR cannot be made prior to the fitting. A fo- 
cussed investigation on these models and on how to apply 
them to different redshifts and types of galaxies is postponed 
to a dedicated paper. 

Nowadays, in order to simulate more realistic star forma- 
tion histories, several authors adopt a two-component tem- 
plate, such that a second burst at an arbitrary time is super- 
imposed on an underlying exponentia l ly declining star for- 



mation liistorv (jPapovich et al.ll200ll: iGallazzi et alj 
Salim et all 1200?: 'Pozzetti et all 120071:1 



Walcher et al.1 



2005 



2008 



Muzzin et ai] :2009: Noll ct al. 2 0091'). In this w ay, the stellar 



mass can be maximised. IPozzetti et al.l (|2007|) report an up 
to 40% increase in stellar mass using two-component mod- 
els over models with simpler SFHs. This composite model 
maybe a much better choice at low redshifts. 



In Fig. [12] we show the mass recovery as a function of the 
metallicity adopted in the templates. For star-forming galax- 
ies in the dust free case the lowest metallicity setup performs 
best at each redshift. The true mass is still underestimated 
by a median of 0.3 - 0.1 dex. The mass recovery becomes 
worse with increasing metallicity because metallicities are 
too high compared to the mock (see Fig.[2ll. This is distorted 
when reddening is included and masses are now underesti- 
mated by a median 0.6 - . 1 dex . Overall, as already con- 
cluded by iBolzonella et al.l (|2009l ), the differences between 
different metallicity setups are small. Consequently, fitting 
with a mono-metallicity template setup is sufficient for star- 
forming objects even when the metallicity is wrong. 
Since the age-dust degeneracy is the main reason for under- 
estimated ages which in turn cause underestimated masses 
we quantify the effect of putting a prior on the minimum 
age of templates in the wide setup or choosing a coarser 
age grid. An age restriction is often applie d in the literature 
IBolzonella et al.ll2009l : IWuvts et al.ll2009l ') and we have also 
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Figure 14. Stellar masses of mock star-forming galaxies derived using templates with IMFs different from the input Salpeter IMF. Red 
marks solutions obtained with reddening, black the unreddened case. Green solid lines represent fits to the unreddened case, magenta 
dashed lines fits to the reddened case. There is little difference between them (see also tables [3l and [4ll . For obtaining scaling relations 
we used the entire merger tree for the linear fits. The coefficients of the fits are provided in section [6l 



explored its effect in MIO, though there we could conclude 
that this trick is not sufficient to ensure a correct derivation 
of galaxy properties and - especially - does not allow to make 
progress in understanding galaxy evolution. We found that 
among the explored age grids, a minimum age of 0.1 Gyr im- 
proves the mass estimate the most (Fig. WiyV The resulting 
regulation of both the overshining and the age-dust degen- 
eracy is particularly effective for the oldest galaxies (at low 
redshift) and in the reddened case. Here, the improvement 
is 0.4 dex on average, but can be much larger for single ob- 
jects. Especially, at z > 1 stellar masses are remarkably well 



recovered with this artificial age constraint. Hence, if one is 
merely interested in estimating the stellar mass setting an 
artificial age constraint appears to be a good idea. We found 
that switching off reddening and applying a minimum age 
cut has similar effects. 

The difference in the stellar mass recovery when one uses an 
incorrect IMF is small (Fig. fTl)) . Cases for different IMFs 
differ the most at z — 0.5 for the reddened case - a clear ef- 
fect of the age-dust degeneracy. A top-heavy IMF - the most 
discrepant to a Salpeter IMF - fails to reproduce the correct 
stellar mass (Fig. llSp . At each redshift we find a large scat- 
ter around the true masses. 



"^ We have also explored larger minimum ages - 0.5, 1 and 5 Gyr 
- for z=0.5 star-forming galaxies and found that these have a 
negative effect on the mass estimate. 
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Figure 15. Median stellar mass recovery as a function of redshift and IMF (wide template setup and UBVRIJHK+IRAC wavelength 
coverage for each). 68% confidence levels are shown as error bars. Redshift increases from the top left to the bottom right. Red and black 
mark solutions obtained with and without reddening, respectively. 



However, there is no clear ofTset between these IMF^ (even 
without dust) for galaxies with little star formation, unlike 
commonly assumed in the literature. This demonstrates that 
other free parameters in the fitting, such as age and SFH are 
able to compensate for the wrong IMF. F or young galaxies 
with high star formation an offset exists. iBolzonella et al.l 
((20091) state statistical differences of stellar mass between 
the various IMFs as log M^^jp^t^^ ~ log M*+y with y — 0.23 
and 0.19 for Chabrier and Kroupa IMF (x), respectively, but 
acknowledge that the best fit value for other parameters can 
significantly differ. We list our scaling relations in section [6l 
Although Figs . [Til and [TSl show that the choice of the IMF is 
felt in the fit and the correct IMF recovers masses best, the 
right IMF cannot be identified by choosing the fit with the 
minimum xt among all solutions. In fact, very often even 
a top-heavy IMF is picked. This confirms again that the 



For verification we used the entire merger tree, but in Fig. 1151 
only show the sample of 100 galaxies. 



minimum xt does not necessarily provide the best physical 
solution and confirms the notion that picking up the IMF 
from SED-fits is virtually impossible. 

We focus now on the effect of the assumed wavelength cov- 
erage in the fitting, on stellar masses for the wide setup at 
each redshift and the inverted-r models at redshift 2 (Figs. 
[16] and [nil. 

The dependence on wavelength coverage is weak for old 
galaxies with little on-going star formation at z=0.5 - masses 
are similarly underestimated for each filter setuju. This is 
obviously driven by the overshining and the wrong SFH 
which the filter setup cannot help rectifying. For 2 > 1, 
where galaxies are younger and the SFR is higher, we 
find that the broader the wavelength coverage, the bet- 
ter the mass recovery, particularly when reddening is in- 



^ Note that excluding SSPs in the fitting setup improves the mass 
estimate in the reddened case by ~ 0.2 dex for the various filter 
setups. In the unreddened case the improvement is not significant. 
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Figure 16. Stellar mass recovery (median) of star-forming galaxies as a function of wavelength coverage and filter set at redshifts 0.5, 1, 
2 and 3, for the wide template setup. The filter setup is varied from left to right in each panel, redshift increases from top left to bottom 
right. Red represents reddened solutions, black unreddened ones. 68% confidence levels for each setup are shown. 



volved. When red filter bands are excluded in the fit, 
masses are underestimated by up to 2 dex. The cover- 
age of the rest-frame near-IR appears to be crucial - 
as concluded in M06. Th e sam e conclusion was r eache d 
by iKannappan fc GawiseJ (120071'): iBolzonella et al.l (|2009|): 
[Lee et al.' ('2009^ and lllbert et al.l (|2010l '). IShaplev et all 



(2005) van dcr Wei et al 



_ _ _ , , (|2006h conclude the opposite. 

iMuzzin et al.l ((20091) merely find an improvement in the con- 
fidence levels for the derived parameters when including the 

rest-frame nea r -IR. 

iPozzetti etai] (|2007l ) (for z > 1) and IWalcher et al.l (|2008l ) 
(for z < 1.2) find that the lack of near-IR filter bands 
(observed-frame) in the fit leads to larger stellar masses com- 
pared to estimates which included them. We find a decrease 
instead. We assign this different behaviour to the differences 
in stellar population model and SFHs used in the fitting 
setup. 

For z ^ 2 we find that excluding filter bands covering the 
rest-frame UV allows one to recover stellar masses on aver- 



age equally as well as the full filter set. Whereas the near-IR 
contains information from older, less luminous stellar pop- 
ulations and allows us to get a robust estimate on stellar 
mass, the UV contains mostly information about younger, 
more recently formed stellar populations. Hence, for a good 
stellar mass recovery, the bluest filter bands are less impor- 
tant and may even be damaging. Clearly, a filter setup that 
only covers the rest-frame UV is useless for the mass recov- 
ery of high redshift galaxies. Remarkably, the combination 
of optical (BRI) and only one near-IR filter (K) recovers 
masses reasonably well. This could be useful for economis- 
ing the request of telescope time. 

The inclusion of a further filter band ( Y with X^ff ~ 
lOOOOA) does not improve the stellar mass estimate inde- 
pendently of redshift and reddening. In general, the mass 
recovery at redshifts z ^ 1 is extremely good, provided one 
uses a filter set that covers the rest frame near-IR because 
this helps to pick the right star formation history (see M05) . 
In MIO we showed that inverted-r models recover the stellar 
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Figure 17. Median differences between true and recovered stellar mass as function of wavelength coverage and filter set for the inverted-r 
templates at redshift 2. Blue and green symbols refer to the mass recovery obtained with the wide setup without and with reddening, 
respectively. 



masses of high redshift star-forming galaxies best. We show 
here (Fig. I17p that this is robust against variations in fil- 
ter setup because this model is less dependent on age. Only 
for the most restricted wavelength ranges and those with- 
out rest-frame near-IR a significant scatter around the true 
masses is found. 

As a summary of this whole section, the stellar masses of old 
galaxies with little residual star formation are usually un- 
derestimated because ages are underestimated which in turn 
is due to a mismatch in SFH between template and galaxy 
and the overshining effect. This effect is worse when dust 
reddening is included because of the age-dust degeneracy. 
This depends only very little on metallicity and wavelength 
coverage. An artificial age constraint improves the mass es- 
timation. 

The masses of young galaxies with high star formation are 
slightly underestimated when age is a free parameter in the 
fit. Masses are perfectly recovered with inverted-r templates 
with proper priors on formation epochs and r, as shown in 
MIO. Dust reddening and metallicity affect the whole result 
very little. Masses are increasingly underestimated for nar- 
rower wavelength coverages when the wide setup is used. 
Here, the rest-frame near-IR is crucial for a robust mass es- 
timate. Masses derived using inverted-r models are less sen- 



sitive to wavelength coverage. Overall, a wide wavelength 
coverage also implies a smaller scatter. 

The SED-fit and the mass estimate is sensitive to the choice 
of IMF of the fitting template, but the correct IMF cannot 
be identified by means of the overall minimum xt- 
Ranges including 68% of the solutions for the stellar mass 
recovery are listed in Tables [Bll and [B2] 



4-1-5 Star formation history and star formation rate 

Fig. [H] shows that - similarly to stellar masses - SFRs agree 
very well with input SFRs only when reddening is switched 
off (median A log SFR f» 0, see Tables IbTI and |B2]) . This 
is nearly independent of template setup. When reddening is 
involved, however, some SFRs are drastically overestimated 
at 2: ^ 1. As was shown in MIO, such very high star forma- 
tion rates in combination with low stellar masses are a pure 
artifact of SED-fitting. The inverted-r setup allows for the 
best estimates. The wide setup reproduces the SFR best at 
z = 1 but this may just be due to the particular composition 
of the mock galaxies at these redshifts. At redshift 3 for the 
wide and only-r setup SFRs are also underestimated at the 
high SFR end because of SFH mismatches. Low SFRs are 
overestimated for the same reason. The inverted-r setup can 
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Figure 18. The recovery of SFR as a function of template setup, namely wide, only-r and inverted-r from top to bottom. Redsliift 
increases from top left to bottom right. Red circles refer to cases with reddening, black dots to no reddening. Quality factors are given 
for the entire SFR range for reddened and unreddened case, respectively. Green triangles show the SFR estimate for the youngest objects 
at z=0.5 with increasing SFH when an inverted-r model of age 1.1 Gyr is used in the fitting. These are the same objects as in Fig. 1111 
Objects with SFR=0 arc not shown. SFR=0 indicates that the best fit solution is an SSP or a truncated SF model with an age larger 
than the truncation time. 



recover the SFRs well for objects with increasing SFH. In 
MIO we showed that SFRs derived from inverted— r models 
are in excellent agreement with those derived with the UV- 
corrected-slope method for high-redshift star-forming galax- 
ies in the GOODS-S sample (see Fig. 19 in MIO). Note that 
in MIO we used only the Calzetti reddening law in the fit 
while in this paper we show SFRs that are derived with all 
reddening laws which results in small differences in SFR for 
some objects. 

The accuracy with which one estimates SFRs also profits 
from the constraint in age and the limited age-dust degener- 



acy. Particularly, low star formation rates are overestimated 
less at z = 0.5 (Fig. I19|) . Only allowing ages larger than 
0.1 Gyr gives the best result, although some degree of over- 
estimation is still present. Simply rebinning the age grid, 
however, does not work, SFRs are even more overestimated. 
At higher redshift the improvement is less than at z — 0.5. 
Again a rebinned age grid performs worse and leads to un- 
derestimation of the highest SFRs. 

The effect on stellar mass and SFR caused by the age con- 
straint is the same for each template setup (excluding the 
inverted-r setup because its age is fixed). 
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Figure 20. SFR recovery as a function of template IMF at rcdshift 2. Colours are the same as in Fig. 1321 SFR recovery for a Salpeter 
IMF is shown in Fig. 1181 SFR=0 cases are not shown in this plot. 



SFR estimates using templates with a Kroupa or a Chabrier 
IMF instead of a Salpeter IMF show qualitatively the same 
behaviour (in the case with reddening). In the unreddened 
case SFRs are predicted correctly from fits with these three 
IMFs. In Fig. [20] we show the star formation rates derived 
with a Salpeter, Kroupa, Chabrier and top-heavy IMF for 
redshift 2 galaxies as an example. SFRs for a top-heavy IMF 
are generally lower than SFRs for a Salpeter IMF and also 
lower than the true value. This leads to a clear offset in the 
no reddening case, independently of redshift. When redden- 
ing is included, SFRs increase as the age decreases. 
Fig. [21] demonstrates the effect of wavelength coverage (or 
lack thereof) on the determination of the star formation rate. 
SFRs are generally well recovered in the unreddened case 
partly because of compensating effects. Deviations from this 
are due to a lack in wavelength coverage (RIJHK-I-IRAC at 
z = 3). In the reddened case SFRs are overestimated at al- 
most all redshifts. While the mass recovery is nearly insensi- 
tive to the neglection of blue filter bands, the SFR estimate 
is hampered. This is not surprising since the most recently 
formed stars dominate the light emission in the UV which 
therefore plays an important role in the estimation of SFRs. 
At 2 = 3 SFRs are also underestimated at the high end. 
In the reddened case, SFRs are best recovered with filter se- 
tups that cover the broadest wavelength range, while in the 
unreddened case a dependence with wavelength coverage is 
negligible. 

We show the derived SFRs from inverted-r models in Fig. 1221 
for a selection of filter setups. Similarly to the mass deriva- 
tion, the SFR estimates do not differ much with filter setup 
and wavelength coverage. SFRs are very well reproduced in 
the unreddened case. In the reddened case, SFRs are overes- 
timated for all filter setups when SFHs between mock galaxy 
and template do not match. 

As already concluded in MIO, the SFR determination is 
driven by the correct SFH. The good SFR recovery in the 
un-reddened case a,t z — 0.5 despite the mismatch in SFH is 



due to compensating effects with underestimated ages and 
stellar masses. This again reinforces our conclusion, that the 
simultaneous recovery of mass, age, SFR, etc. for galaxies 
from SED-fitting is not possible without knowing the exact 
star formation history. 



4.2 Passive galaxies 

4.2.1 Age 

Fig. [23] shows the age recovery of mock passive galaxies as a 
function of template setup for wide, only-r, only SSPs (but 
all four metallicities) and only solar SSP template setups. 
Overall, the age determination is much better than was the 
case for star- forming galaxies (Fig. [6]). A wider range of star 
formation histories and metallicities in the fitting leads to 
generally overestimated ages in the unreddened case at red- 
shifts 0.5 and 1 with the mismatched SFH as the biggest 
driving factor. For the wide setup at z=Q.5 ages are recov- 
ered within ~ dex to ~ 0.15 dex in 68% of the cases when 
reddening is switched off (see Tables lB3l and |B4)) . Overall, 
the age recovery improves with templates with the correct 
metallicity (i.e. only-r and solar SSP). Additionally, match- 
ing the SFH correctly recovers the age best. Small deviations 
from the true age still occur when fitting with the correct 
template for two reasons: 1) a slight mismatch in age be- 
tween template age grid and galaxy agq^j and 2) photomet- 
ric uncertainties. 

Since the choice of SFH is limited in the only-r setup and 
the metallicity is the correct one, short r's which resemble a 
SSP closest are chosen. The longer period of star formation 
is then compensated by an older age. Within the wide setup 
a variety of SFHs exists besides the correct solution of a 
solar metallicity SSP. Both r and truncated models closely 

^^ For ages of 1 and 1.5 Gyr the closest matching age of the 
template age grids are 1.015 Gyr and 1.434 Gyr. 
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Figure 21. SFR recovery as a function of wavelength coverage and rcdshift (increasing from top left to bottom right). The filter setup 
is varied from top to bottom in each panel. Objects with SFR=0 are best fit with a SSP or a truncated model with age larger than the 
truncation time and are not shown in this plot. Red circles refer to the reddened case, black dots to the unreddened case. 



resemble a SSP as long as the timescale r for star formation 
or truncation time t are short enough compared to the age. 
The differences between them are smaller than the imposed 
photometric uncertainties. Furthermore, a wider variety of 
metallicities is available. Thus, it is not surprising that the 
age-metallicity degeneracy and a degeneracy between age 
and SFH cause the average recovered age to be slightly older 
than for the only-r setup and that the dispersion is larger. 
The median offset in age for the wide setup is maximally 



- 0.04 dex. 

Comparing results for the only-SSP and solar-SSP setups - 
thus using the correct SFH - enables us to isolate the effect 
of the age-metallicity degeneracy. This is small on the me- 
dian, but the scatter induced by using wrong metallicities is 
much larger (Tables [B3l and [Bi|l . 

Similarly to the mock star-forming galaxies, including red- 
dening in the fit has the largest effect for the oldest galaxies 
(z=0.5) due to the age-dust degeneracy. But overall, the ef- 
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Figure 23. Median recovery of age for mock passive galaxies as a function of redshift and template setup. Black and red symbols refer 
to the cases without and with reddening, respectively. 68% confidence levels are shown. 



feet of reddening is small. 

Another way to decouple metallicity from SFH effects is to 
use mono-metallicity wide setups for which we show the re- 
sults in Fig. [211 Obviously, ages derived with a solar metal- 
licity setup are best recovered. The remaining scatter of 
0.05 — 0.1 dex stems from the SFH mismatch. Template 
setups with sub-solar metallicity overestimate the age to 
compensate their bluer colour. For the lowest metallicity, 
every galaxy is fit with the maximum age possible at the 
given redshift (or close to that age) in order to compensate 
the underestimated metallicity. In addition, all xt are larger 
than 2, most are larger than 10, and fits are getting worse 
towards higher redshift. Clearly, neither age nor SFH can 
compensate such a large discrepancy in metallicity. The sit- 
uation is reversed for super-solar metallicity templates. At 
low redshift the scatter is largest because the covered age 
range is wider. Ages around 1 Gyr are very well defined 
by the TP-AGB emission, older ages are more difficult to 
distinguish because of little evolution in the colours during 
the RGB-dominated epoch. The introduction of reddening 



has the largest impact at low redshift. Particularly at z=l 
the age-dust degeneracy partly compensates and reverses 
the effect of the age-metallicity degeneracy and causes an 
underestimation in age. At high redshift, reddening has no 
impact. 

Most importantly Fig. [24] shows that even when using all 
metallicities ages and metallicities are very well recovered. 
This means that when we do not know the metallicity it is 
better to use a wide range for it, as concluded in MOB. 
Mismatches in the IMF have rather small effects for the 
mock passive galaxies which is why we do not show it. Ages 
derived with different IMFs are similarly well recovered and 
do not show an offset between each other because differences 
in SED-shape due to the IMF are small for intermediate- 
mass stars, hence intermediate and old ages. The scatter 
increases for the oldest galaxies. The effect of reddening is 
negligible. 

We investigate the wavelength dependence of the derived 
ages using a wide setup in Fig.[5S| Median ages are recovered 
within ±0.1 dex in the unreddened case with little depen- 
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Figure 24. Median deviation of fitted age from true age of mock passive galaxies as a function of redshift and metallicity. Black dots and 
68% confidence levels refer to the unreddened case, red to the case that includes reddening in the fit. Note the change in scale between 
redshift 0.5 and 1 and redshift 2 and 3. 



dence on the wavelength coverage at low redshift. At higher 
redshift filter setups that do not cover the rest-frame near-IR 
and the red part of the optical generate a larger scatter. As 
already mentioned, the mock passive galaxies at these red- 
shifts are 1-2 Gyr old, the age range in which the near-IR 
is dominated by the TP-AGB emission. Without this crucial 
information, the fit is more prone to degeneracies between 
age, metallicity and SFH. Interestingly, a filter setup con- 
taining only four filter bands but including the near-IR and 
thus sampling the SED shape over a wider wavelength range, 
performs in most cases equally as good as other filter sets. 
The lack of rest-frame UV data at z=2 and 3, causes ages 
to be overestimated because of star formation history mis- 
matches. 

The inclusion of reddening in the fit has its biggest impact 
at low redshift. Here the age-dust-degeneracy causes a large 
scatter and an on average underestimated age for shorter 
wavelength coverages. Particularly, the large scatter shows 
that the age recovery fails when the information of the blue 



and reddest filter bands is missing. Overall, ages are best de- 
rived with the full wavelength coverage. Compared to star- 
forming galaxies, ages of passive galaxies can be better de- 
termined in the fit due to a well-defined 4000 A break and 
the absence of multiple generations. 

In summary, mismatches in SFHs are contributing the most 
to an offset in the derived ages. Thus the best SFHs for 
passive galaxies are SSPs and exponentially declining SFRs. 
As metallicity can be recovered (see also section l4.2.2|l . SED 
templates should cover a range in metallicities. Ages are best 
determined using the full wavelength coverage. Note that a 
ugriz filter setup at z ~ 0.5 performs well, albeit with larger 
scatter, which is mostly due to the low number of objects. 
This is an important finding as we are obtaining galaxy 
properties (stellar masses in particular) of BOSS (Baryon 
Oscillation Spectroscopic Survey) galaxies using this setup 
(Maraston et al. in prep.). We list median offsets between 
recovered and true age as well as 68% ranges in Tables IB3I 
andlB4l 
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Figure 25. Median deviation of fitted age from true ages of mock passive galaxies as a function of wavelength coverage at redshifts from 
0.5 to 3 using the wide setup. Black dots and 68% confidence levels refer to the unreddened case, red to the case that includes reddening 
in the fit. 



4.2.2 Metallicity 

For nearly passive objects at z ~ 2, M06 found that metal- 
licity plays an important role in allowing a robust determi- 
nation of the stellar population parameters. We confirm this 
result in section 14.2.11 Although a wide choice of metallici- 
ties is provided, the fit closely recovers the true metallicity 
for most objects (Fig. [26}. When reddening is introduced, 
the age-dust-metallicity degeneracy affects mostly the oldest 
galaxies (at low redshift). 

Derived metallicities depend on the wavelength coverage in 
the fitting such that at low redshift a lack of rest-frame near- 
IR and red optical coverage causes metallicities to be over- 
estimated in more cases, clearly, by compensating underes- 
timated ages. The rest-frame near-IR is crucial in breaking 
the age-metallicity degeneracy as already concluded in M06. 
When blue filter bands are excluded metallicities are under- 
estimated for a few more objects. These effects are smaller 
at higher redshift. Including reddening results in underesti- 
mation of metallicity for more objects when the wavelength 



coverage is restricted. Again the effect is largest at low red- 
shift. 

In conclusion, for nearly passive and aged galaxies it is im- 
portant to fit with a wide choice of metallicities because 
metallicity can be recovered. The best template setup for 
metallicity recovery is the wide setup. The broadest wave- 
length coverage results in the best metallicity estimates. 



4.2.3 E(B-V) 

Since we did not add reddening to the passive galaxies, any 
reddening resulting from the fitting procedure is overesti- 
mated. But generally, the reddening is correctly identified 
for most objects (Fig.[23 Tables |B3] and HU. At z=0.5 68% 
of recovered reddening show offsets between and 0.07 from 
the true reddening. Independently of template setup, the 
largest effect is found at low redshift where the age-dust de- 
generacy is more efficient as the allowed age range is larger. 
More objects are wrongly identified to be reddened as wave- 
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Figure 13. Average recovered stellar mass in given input mass 
bin (binsize 0.5 dex) for the wide setup with a minimum age of 0.1 
Gyr and a rebinned age grid. Red points and lines represent the 
reddened case, black stands for the unreddened case, blue thin 
lines show recovered masses obtained with the full age grid in- 
cluding reddening, green thin lines refer to a setup with rebinned 
age grid and reddening. Errorbars are one standard deviation. 
Quality factors are given for the entire mass range. 



length coverage decreases (in both red and blue wavelength) 
but the effect is small. 



Figure 19. Averaged recovered SFRs (y-axis) compared to 
known binned SFRs (x-axis) as a function of age grid. Repre- 
sentatively, we show results for redshift 0.5 and 2 only. Red dots 
and lines represent solutions with reddening, black ones those 
without reddening, both when the minimum age is 0.1 Gyr. Blue 
thin lines show the SFR recovery with a wide setup and redden- 
ing. Green lines refer to a wide setup with rebinned age grid and 
reddening in the fit. Binsizes vary from 2.5MQ/yr to 2OM0/yr 
from z = 0.5 to z = 3. Errorbars are one standard deviation. 
Quality factors are given for the entire SFR range for reddened 
and unreddened cases, respectively. 



4.2.4 Stellar Mass 

The mass recovery for the mock passive galaxies is very 
good (Fig. I28p and the median offset in mass is maximally 
± ~ 0.02 dex and 68% ranges are within ± ~ 0.10 for all 
template setups (see Tables IB3I and IB4|l . As usual, differ- 
ences between the reddened and unreddened case are small. 
Scatter is largest at low redshift. Deviations from the true 
stellar mass for the wide setup are due to degeneracies be- 
tween star formation histories, metallicity, age and dust 
(when included in the fit). Offsets in stellar mass even when 
using the correct template are caused by a slight mismatch 
in age between template and galaxy for the youngest ages 
and photometric uncertainties (see also sections 14.2.11 and 
14.51 respectively). When fitting with the solar metallicity 
SSP these small age mismatches cannot be compensated by 



a different choice of metallicity and SFH (or dust at high 
redshift). The contribution of dust increases the errors on 
the derived ages at low redshift because of the age-dust de- 
generacy. Clearly, in the other template setups the mass is 
corrected by the choice of an incorrect metallicity and/or 
SFH and, in the reddened case, dust. 

Metallicity effects are further explored in Fig. 1291 Stellar 
masses follow the same trend with metallicity as ages (com- 
pare to Fig. I24|l . For lower metallicities for which ages are 
overestimated, stellar masses are larger. For higher metallic- 
ities masses are smaller. In the unreddened case, masses de- 
rived from the lowest metallicity template are significantly 
underestimated as a direct consequence of the metallicity 
mismatch and its effect as described in section 14.2.11 This 
shortcoming in metallicity can only be partly compensated 
by age and not at all by SFH (which would just make the 
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Figure 28. Median mass recovery of mock passive galaxies for different template setups as a function of redshift. Red dots and 68% 
confidence levels refer to the inclusion of reddening in the fitting, black symbols to the unreddened case. Blue and green symbols refer 
to the mass recovery obtained with the wide setup for mock star-forming galaxies without and with reddening, respectively. 



SED even bluer). At z > 0.5 the fit with the lowest rtietal- 
licity setup fails completely (x^i^ > 50) and masses are not 
recovered. Adding dust improves the fits since some of the 
required red colour of the observed SED can be compen- 
sated by dust but ages are still overestimated. This results 
in overestimated masses. 

Stellar masses of mock passive galaxies are best determined 
with a template setup based on the same IMF used as input, 
namely a Salpeter IMF (Fig. I30|I^H . Although ages derived 
with a Kroupa and Chabrier IMF template setup are very 
similar to those for the Salpeter setup, stellar masses are 
systematically underestimated by ~ 0.1 — 0.2 dex. The stel- 
lar masses obtained with a top-heavy IMF template setup 
on the other hand are largely overestimated (by ~ 1 dex 
on average). These differences can be explained with the 

^^ The inclusion of dust reddening in the fitting has very little 
impact on these results and merely increases the scatter at lower 
redshift. 



different mass-to-light ratios (AI/L) for the various IMFs. 
As show in Maraston (1998) bottom-light IMFs have lighter 
M/L because of a lower amount of dwarf M stars whereas 
a top-heavy IMF is heavier because of the large amount of 
massive remnants. It is encouraging that the choice of IMF 
is felt in the fit, but an identification of the correct IMF 
based on the minimum xt is not possible as shown in Fig. 
1301 The large scatter stems mainly from the top-heavy IMF. 
Unfortunately, no comparison can be made with the true 
stellar mass for real galaxies. 

The median stellar mass recovery of passive galaxies depends 
little on the wavelength coverage used in the fit, but the scat- 
ter increases when fewer filter bands are used in the fitting 
(Fig. 1311 Tables [B3l and [B4)) . Masses are best recovered with 
the full wavelength coverage when the rest-frame near-IR is 
included. Stellar masses are recovered within 0.05 dex when 
reddening is not included in the fit. Similar to the ages, the 
inclusion of reddening affects the most at low redshift where 
masses are underestimated for filter setups lacking blue fil- 
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Figure 29. Median recovered stellar mass with 68% confidence levels for mono-metallicity wide setups as a function of redshift and 
therefore galaxy age (upper left to lower right). Metallicity in each panel increases from left to right. The input metallicity is solar. 
Missing points at Z=0.004 without reddening lie at — 1-51_„',.„ at z=2 and at — 3.31_„'c,, at z=3. Symbols are the same as in Fig. 1281 



ter bands. Towards higher redshift the exclusion of the rest- 
frame near-IR wavelengths in the fit introduces larger errors 
in the mass estimation. The information stored in the rest- 
frame near-IR is crucial in order to recover the right stellar 
mass. 

In summary, stellar masses of passive galaxies can be recov- 
ered within ~ 0.02 dex for a wide coverage in metallicities, 
SFHs and wavelength and the exclusion of reddening in the 
fitting. Restricting the template setup in metallicity has the 
strongest effect on the mass estimate. The dependence on 
wavelength coverage is small though not negligible and the 
scatter decreases when the rest-frame near-IR is included. 



4.3 Measuring the strength of the latest starburst 



As widely discussed in MIO and section 14711 a young stellar 
population overshines underlying older populations. Natu- 
rally, the question arises whether the derived stellar popu- 
lation properties refiect those of the latest burst. For this 



purpose we simulated passive galaxies located at redshift 1 
that consist of a large percentage of an old stellar popu- 
lation and a low percentage of a younger population, both 
with solar metallicity. The old component is 5 Gyr old, for 
the young component we take ages of 1 Myr, 10 Myr, 100 
Myr and 1 Gyr. We combine old and young components as 
99 and 1 % and 90 and 10 %. Each galaxy has a total mass 
of 10^^ Mq. Thus, recovered stellar masses of 10^*^ Mq and 
10^ Mq would refiect the mass of the burst. Ages of ~ 4.5 
(for 10% young) and '-^ 4.95 Gyr (for 1% young) would re- 
fiect the mass-weighted age of the galaxies. The fitting is 
carried out using the broadest wavelength coverage and a 
wide setup. Results for stellar mass and age are listed in 
Tabled 

Clearly, even just one percent of a relatively younger stellar 
population with respect to the dominant one has the power 
to hide 30 — 40% of the stellar mass locked up in old stars. 
When the population is just 1 Myr old it even hides nearly 
99%. For these ages, the fit renders precisely the mass of the 
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Figure 30. Median stellar mass recovery as a function of the IMF assumed in the fitting template for a wide template setup and 
UBVRIJHK+IRAC wavelength coverage. The input IMF is Salpeter. Errorbars are 68% confidence levels. Redshift increases from the 
top left to the bottom right. Symbols are the same as in Fig. 1281 



burst. When reddening is allowed in the fit, an age of 10 Myr 
is already young enough to hide nearly 99% of the remain- 
ing old stellar population. In both cases is the mass of the 
burst recovered within ~ 0.3 dex rather than the mass of the 
entire galaxy. The situation is very similar when the latest 
stellar population contributes 10%. The recovered mass is 
closer to the burst mass for the youngest bursts. Ages are 
very close to the age of the young component. When red- 
dening is involved, the age-dust degeneracy even causes the 
recovered mass and age to be lower than those of the burst 
component which clearly is a mere artefact stemming from 
the many degeneracies. Derived ages and masses are closest 
to that of the old population when the young population is 
1 Gyr old. 

Thus, a wide age gap between old and young stars is the 
biggest challenge in the fitting. This explains the worse re- 
covery of the physical properties of the mock star-forming 

galaxies at z=0 .5. 

ISerra fc Tragen (|2007l ') found in a similar exercise that the 



stellar ages inferred from Lick-indices measurements of two- 
component composite stellar populations are driven by the 
younger less massive component. 



4.4 Reddening la'ws 

In this section we explore the dependence of the fitting re- 
sult on the adopted reddening law. The question is: Can a 
better overall result be achieved by considering several red- 
dening laws or is using just one - which makes the fit faster 
- sufficient? 

Since we do not add dust reddening to passive galaxies, we 
investigate the case only for star-forming galaxies. Naively 
one would expect the Calzetti reddening law to provide the 
best fit and most accurate result for all reddened objects 
since the reddening prescription for the semi-analytic galax- 
ies is based on it (section [5]). We find that the best mass 
estimate is achieved when no reddening is used in the fit- 
ting. This is also true at higher redshift where galaxies are 
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Figure 31. Median differences between true and recovered stellar mass as function of photometric filters used in the model fitting, at 
redshifts 0.5, 1, 2 and 3. The filter setup is varied from left to right in each panel, redshift increases from the top left to the bottom right. 
Given are the 68% confidence levels for each setup. Symbols are the same as in Fig. 1281 



dust reddened. 

The largest difference in mass recovery between the red- 
dening laws occurs at z — 0.5 where there is little red- 
dening anyway (Fig. [32}. The MW law by Allen and the 
SMC law give the worst mass recovery. This is also re- 
flected in the results for the best fit among all redden- 
ing laws. In particular, the SMC reddening curve is much 
ste eper and higher than t he Calzetti law (compare Fig. 7 
in iBolzonella et al.ll2000r ). On average, the exact type of 
reddening law is less important at high redshift but varia- 
tions in the q uality factors i mply larger effect s for individ- 
ual ob je cts. iPapovich et al.l 1120011); iForster Schreiber et al.l 
||2004): IPozzetti et all (|2007l '): iMuzzin et al.1 l|2009l ') aiid 
iMuzzin et al.l ( 20091 . b) also found the effect of different ex- 
tinction laws on stellar masses to be small. 
SFRs on the other hand are much more sensitive to the type 
of reddening law (Fig. I33|l . We can summarize this in two 
points: 
1) When the reddening is a fitting parameter and galax- 



ies are dust reddened, the fitting code is able to pick out 
the right reddening law for most objects, the Calzetti law in 
our case. Fitting without reddening leads to underestimated 
SFRs. 

2) When galaxies have little reddening (z=0.5 case) and red- 
dening is a free parameter in the fit, the age-dust degeneracy 
causes SFRs to be overestimated. SFRs are best recovered 
with no reddening in the fit in this case. 
In conclusion, stellar masses are best derived without red- 
dening in the fitting at each redshift when the SFHs cannot 
be matched. For the best SFR estimate on the other hand 
all reddening laws should be used in the fitting for high 
redshift, dust reddened objects. For objects without dust 
reddening, SFRs are best obtained without reddening in the 
fitting. These conclusions stress again that the simultane- 
ous recovery of masses and SFRs is very difficult and can 
be achieved only when the assumed SFH is the correct one, 
as we found at z=2 with inverted-r models and a proper 
parameter selection (MIO). 
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Figure 34. Left: Effect of photometric uncertainties for mock star-forming galaxies at z=0.5 for a wide template setup and broad 
wavelength coverage in the fit. From top to bottom: ages, stellar masses and star formation rates, respectively. Right: The same for mock 
passive galaxies. Red and black symbols refer to the cases with and without reddening in the fit. We show galaxies at redshift 0.5 because 
for these the differences from the true values are largest. 



4.5 The effect of photometric uncertainties 

Comparing the results obtained when using original magni- 
tudes and those scattered using a Gaussian erroio in the 
fit enables us to study the effect of photometric uncertain- 
ties. In order to single out this effect we only consider the 
case without dust reddening. We focus on the older galaxies 
with little or no on-going star formation (mocks at z=0.5) 
for which we find the largest deviations between true and 
recovered stellar population properties. Star-forming galax- 
ies are fit with the wide setup. Passive galaxies are fit with 
a solar metallicity SSP only. 

Fig. [34] (top left) shows that ages can be very discrepant, 
reaching 1.5 dex difference. Masses differ on average by 
0.08 ± 0.27 dex and up to ~ 0.7 dex (Fig. \M middle left). 
The overall trend of underestimating the masses of older 
galaxies with little on-going star formation remains unaf- 
fected by the photometric uncertainties. 
The mean difference between the derived SFRs is small for 



^■^ Original magnitudes were scattered with their three sigma 
photometric errors. 



the highest SFRs (see Fig. [Ml bottom left). At the low SFR 
end deviations are larger (up to 1 dex). 
Fig. [31] and Fig. [34] (right) shows that for passive galaxies 
the effect is small. The largest deviations in stellar mass are 
found at z=0.5 amounting to maximal ~ 0.2 dex. The inclu- 
sion of reddening is negligible. Ages are more affected in the 
reddening case, particularly at the youngest and oldest ages, 
because of the age-dust degeneracy. Older ages are hard to 
distinguish from each other in the fitting as the spectra are 
very similar. 

We further explored what effect the size of photometric er- 
rorbars has on results for stellar masses for both types of 
galaxies in the reddened and unreddened case when a wide 
wavelength coverage and wide template setup is used. For 
this we investigated the following for the mock star-forming 
galaxies: 1) using the 3 a photometric errors where a is ob- 
tained from the COSMOS and GOODS-South surveys to 
scatter the original magnitudes (instead of 3 cr with a = 0.1 
mag) and carrying out the fit with the survey photometric 
errors as described in section[2] 2) Using the 3 a photometric 
errors where cr = 0.1 to scatter the original magnitudes and 
carrying out the fit with a = 0.1 for each band. 3) Doubling 
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Figure 22. SFR recovery using inverted-r models for moclc star- 
forming galaxies at redshift 2 as a function of wavelength cover- 
age. The filter setup is varied from top to bottom. Red circles are 
with reddening, black dots are without reddening. 
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Figure 26. Recovered metallicities of mock passive galaxies at 
redshifts from 0.5 to 3. Black and red are without and with red- 
dening, respectively. 



and tripling the photometric errors in the fit when using sur- 
vey photometric errors and a = 0.1 mag photometric errors 
in the fit when magnitudes were scattered with 3 a errorbars 
where a = 0.1 mag. Case 3) is also investigated for the mock 
passive galaxies. 

We find the largest effect for mock star-forming galaxies at 
the lowest redshift for which the median oflfset between de- 
rived and true stellar mass can differ by up to ~0.15 dex 
in the reddened case and up to ~ 0.07 dex in the unred- 
dened case between the various options leaving masses still 
underestimated overall. Increasing the errorbars in the fit 
to double and triple the values plays only a small role. The 
effects are negligible at high redshift and for mock passive 
galaxies. Hence, uncertainties on the star formation histories 
dominate the results. 



5 COMPARISON TO THE LITERATURE 



Other works in the literature exist on this topic. lWuvts et al.l 
l|2009l . hereafter W09) address the efiiciency of SED-fitting 
for a smoothed particle hydrodynamical merger simulation 



of two disk galaxies before (disk phase), during (merger 
phase with triggered star formation) and after the merg- 
ing (spheroid phase) spanning a total time of 2 Gyr, but not 
fixed to a particular redshift. They construct the SEDs at 
various snapshots in time for different viewing angles, place 
them at redshifts between 1.5 and 2.9 and study the effects 
of SFH, dust attenuation, metallicity and AGN contribution 
on the recovery of age, reddening (E(B-V)), visual attenua- 
tion {Ay), stellar mass and SFR. Their SED-fitting uses solar 
metallicity templates for three SFHs, namely SSP, constant 
SFR and a r-model with r = 0.3 Gyr. The templates are 
based on a Salpeter IMF and BC03 stellar population syn- 
thesis code. Av varies between and 4 in steps of 0.2 but 
is only applied to the constant SFR and r-model template. 
Ages are constrained to be larger than 50 Myr and younger 
than the age of the Universe. The fitting is carried out with 
HyperZ and a Calzetti reddening law. 

Our semi-analytic mock galaxies show considerable star for- 
mation for z ^ 1 and thus compare to the disk (and merger 
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Figure 27. Recovered E(B-V) of mock passive galaxies at red- 
shifts from 0.5 to 3. 



Table 2. Derived ages and stellar masses for a 10^^ Mq galaxy 
composed of a 5 Gyr old population and a low percentage - 1 or 
10% - of a younger population. Burst characteristics are listed in 
the first column, while the 2nd and 3rd columns list the derived 
ages and masses. 



Burst 

no reddening 


derived age 


derived M* 

[Mq] 


1% IGyr 


3.75 Gyr 


10.85 


1% lOOMyr 


4.5 Gyr 


10.86 


1% lOMyr 


4.5 Gyr 


10.74 


1% IMyr 


15 Myr 


9.23 


10% IGyr 


2.75 Gyr 


10.83 


10% lOOMyr 


404 Myr 


10.42 


10% lOMyr 


8.7 Myr 


9.91 


10% IMyr 


6.6 Myr 


9.93 


Burst 

+ reddening 


derived age 


derived M* 

[Me] 


1% IGyr 


3.5 Gyr 


10.90 


1% lOOMyr 


4.5 Gyr 


10.86 


1% lOMyr 


7.9 Myr 


9.28 


1% IMyr 


13.8 Myr 


9.30 


10% IGyr 


2.75 Gyr 


10.83 


10% lOOMyr 


6.3 Myr 


9.69 


10% lOMyr 


8.7 Myr 


9.91 


10% IMyr 


6.6 Myr 


9.93 



phase) of WOio. For this we predominantly consider the 
wide setup with age constraint and the inverted-r setup 
(both with reddening) . We compare the results of the wide 
setup without reddening for our mock passive galaxies for 
^ 1 to their spheroid phase. For the purpose of compari- 
son, we define A as the difference between recovered and true 
valuqij analogue to WQ9. Negative values of A correspond 
to underestimation of the according quantity, positive val- 
ues to overestimation. As true age for the mock star-forming 
galaxies we assume the mass-weighted age. We list our me- 
dian values for A together with those of W09 in Table lEll 
WQ9 find slight overestimation of ages in the disk phase and 
underestimation in the merger phase. The ages we derive are 
underestimated at z = 1 and 2 and overestimated at z — 3 
for the wide setup with age constraint. Ages are more under- 
estimated towards lower redshift due to the wide range in 
population ages comprised within those galaxies (compare 
Fig.©. W09 seem to determine better ages as they consider 
only a small age-range, no metallicity effects and set an age 



^^ We do not distinguish our mock galaxies in disks and merg- 
ers although they might have undergone a merging event shortly 
before we observe them 
^* logarithmic values for age, mass and SFR 



constraint. We do not compare the age for the inverted-r 
models as those have a constant age which is more compa- 
rable to the oldest age. While E(B-V) is underestimated in 
all phases in W09, our derived reddening values are overes- 
timated at lower redshift and very well recovered at higher 
redshifts. Our scatter is smaller than that of W09 at all red- 
shifts for the two setups. Stellar masses at 2: ^ 2 are equally 
underestimated as in W09's disk and merger phase. Our 
mass estimate for z ^ 2 improves if we only consider objects 
in the same mass range as WQ9 (log M*> 9.8). However, 
masses are best recovered with the inverted-r setup where 
they show almost no offset with smaller scatter. The me- 
dian stellar mass offset is larger for galaxies at redshift 1. In 
both disk and merger phase W09 underestimate the SFRs. 
We overestimate the SFRs at intermediate redshifts. At z=3 
the median offset in SFR is small. Our scatter is somewhat 
smaller than that of W09. 

Since we did not put reddening onto the mock passive galax- 
ies, we only compare the fit values for the stellar masses, 
ages and SFRs with W09. Even in the case with reddening, 
a value of E(B-V)=0 is correctly recovered for most objects. 
Both ages and stellar masses are slightly overestimated in 
our case, while they are underestimated in W09. However, 
in each case, offsets and scatter are very small. This confirms 
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Figure 32. Average mass recovery as a function of reddening 
law at redshift 0.5 and 2. The dotted black line shows the mass 
recovery when no reddening is used in either spectra or fitting. 
The dashed thick black line shows the result when the best fit out 
of all reddening laws is chosen for each object. The coloured thin 
lines represent the results applying only one reddening law on the 
reddened spectra. Errorbars refer to one standard deviation. 



Figure 33. Average SFR recovery as a function of reddening law 
at each redshift where redshift increases from the top left to the 
bottom right. We show solutions for a wide setup only. Colours 
and lines are the same as in Fig. 1321 Note that the data point at 
the highest SFR at z=0.5 and the two points at the highest SFR 
at z=2 consist of only one object each and are merely shown for 
completeness. Errorbars refer to one standard deviation. 



that stellar masses of passive galaxies can be much better 
determined by SED-fitting than those of star-forming galax- 
ies. 

While W09's reddening and SFRs are underestimated in the 
spheroid phase, both quantities are very well recovered for 
our passive galaxies. We assign this to the fact that the pas- 
sive galaxies we simulated are single bursts while galaxies in 
the spheroid phase in W09 are clearly precessed by consid- 
erable star formation and thus still suffer from overshining. 
For a direct comparison we reproduce Fig. 8 of W09 for the 
basic trends and correlations between these physical proper- 
ties of galaxies and show our results for mock star-forming 
galaxies in Fig. lEll As in W09, in summary, those corre- 
lations are: 1) Age and stellar mass show a strong corre- 
lation, i.e. an underestimation in age will lead to an un- 
derestimation in mass and vice versa. 2) This is anticor- 
related with dust reddening, the so called age-dust degen- 
eracy. 3) It is also anticorrelated with metallicity, the so 
called age-metallicity degeneracy. 4) SFR and dust redden- 
ing are tightly correlated, consequently age and SFR are 



anticorrelated. Furthermore, we find evolution in these with 
redshift. Ages and stellar masses are increasingly underesti- 
mated whereas reddening and SFRs are increasingly overes- 
timated t o wards low redshift. 

I Lee et al.l ([20091) focus on the robustness of the derivation 
of physical parameters of Lyman break galaxies at redshifts 
~ 3.4, 4.0 and 5.0 (U-, B-, and V-dropouts, respectively). 
They create mock galaxies from a semi-analytical model us- 
ing BC03 stellar population models with a Chabrier IMF 
and apply the Lyman break selection criterium to identify 
star-forming objects at high redshift. Their SED-fitting em- 
ploys Chabrier IMF BC03 templates with three metallicities 
(0.2, 0.4 and 1 Zq) and exponentially declining SFRs with 
r = 0.2—15 Gyr. The Calzetti law is used for dust extinction. 
They find that the means of the intrinsic and reproduced 
mass distributions differ between 19 and 25% (for U- and 
V- dropouts, respectively), such that recovered masses are 
underestimated. The corresponding difference for our z = 3 
objects is ~ 17% (or 0.08 dex) using the wide setup with 
reddening and ~ 15% (or 0.07 dex) when the minimum age 
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is 0.1 Gyr. For the inverted-r setup the difference between 
the means of true and recovered stellar mass distributions 
is < 1% ( 004 d ex). 

iLee et alj (|2009J) also report larger discrepancies for mean 
age and SFR in comparison to the true values, mean SFRs 
(averaged over 100 Myr) are underestimated by ~ 62% and 
mass-weighted mean ages are overestimated by about a fac- 
tor of 2. They assign this behaviour to two effects: the mis- 
match between real and template SFHs and the overshining 
effect of a young stellar population. Furthermore, they ac- 
knowledge the influence o f the age-dust degeneracy on the 
derived SFRs. iLee et all l|2009l ) try to solve the SFH mis- 
match in using two-component templates consisting of two 
time-separate burst of star formation with different values 
of T. However, we have shown in MIO and the previous 
sections that using inverted-r models recovers the mass for 
mock star-forming galaxies at z = 2 and 3 perfectly. SFRs 
are slightly overestimated and show a larger scatter at the 
low SFR end where the exponentially increasing SFH of the 
template is not matched by the simulated galaxies. Based 
on the SFHs of the mock star-forming galaxies as shown in 
Fig. Owe believe that the best model to recover the masses 
and SFRs of star-forming objects at 2; > 3 is the inverted-r 
template (using appropriate formation redshifts and ages). 
Because of the fixed age, the overshining by young compo- 
nents is ignored in the fit. Derived ages will therefore always 
be older than input luminosity- or mass-weighted ages and 
compare better to th e oldes t age. 

iLonghetti fc Saraccd (12009) study the systematics involved 
in the mass estimates from SED-fitting of mock early-type 
galaxies between redshift 1 and 2. They simulate early-type 
galaxies with Salpeter IMF solar metallicity BC03 models 
of exponentially declining SFH (r = 0.6 Gyr) and ages be- 
tween 1.7 and 4.2 Gyr. Furthermore, they add secondary 
bursts at different times with mass fractions of 5 and 20% 
and star formation time scale of r = 0.1 Gyr. Dust obscu- 
ration is considered to be Ay ~ 0, 0.2 and 0.5 using the 
Calzetti law. SED-fits are performed using hyperz and so- 
lar metallicity r- models of BC03 with r = 0.1,0.3,0.6, 1.0 
Gyr. They also test the effects of usin g differ ent stellar pop- 
ulation models. ILonghetti fc Saraccd (|2009l ) conclude that 
overall stellar masses of early-type galaxies cannot be esti- 
mated better than within a factor of 2-3 of the true value 
at fixed known IMF. We find that the stellar masses of the 
mock passive galaxies in our work at z=l can be recovered 
within a factor of ~ 1.05 (~ 0.02 dex) at known IMF. Includ- 
ing uncertainties on the IMF (but only considering among 
Kroupa, Chabrier and Salpeter) masses are recovered within 
0.2 dex. Our result is better probably just because we have 
really zero on-going star fo rmation in our passive galaxies 
while ILonghetti fc Saraccd (2009) by considering a r model 
and second bursts keep a small amount of star formation, 
which confuses the derived properties for the reasons we am- 
ply dis cussed above. 

Lastly, iBolzonella et al.l (|2009l ) find a typical total uncer- 
tainty caused by different choices of reddening law, metal- 
licities, SFHs and stellar population models in stellar mass 
estimates of a ~ 0.2 dex. 



6 HOMOGENISING DERIVED PROPERTIES 
VIA SCALING RELATIONS 

For many purposes, combining results from different authors 
and data sets is required. In this case, homogeneity of the 
analysis method is essential. However, different models, tem- 
plates, IMFs and fitting procedures are used in the literature 
and we have demonstrated in the previous sections their im- 
pact on the results. Here, we have the opportunity to obtain 
'scaling relations' which allow the transformation of the de- 
rived physical properties from one set of fitting parameters 
to another. Based on the previous sections we have derived 
such relations for stellar mass for a variety of parameter 
choices using a least squares fit. They are listed in table [3] 
for star- forming galaxies and in table |4] for passive galaxies. 
In order to gain statistical robustness we use results for the 
entire merger tree. Some examples are shown in Fig. \T^ as 
solid green and dashed magenta lines. 

Note that these relations provide information only about 
statistical differences, the difference in stellar mass between 
various fitting setups for a single object can significantly 
deviate from this. 



7 SUMMARY AND CONCLUSIONS 

We analyse the problem of recovering galaxy stellar popu- 
lation properties - such as ages, star formation rates, stellar 
masses - through broad-band spectral energy distribution 
fitting as a function of the various parameters of the fit, 
namely: star formation history, metallicity, age grid, the 
Initial Mass Function (IMF), reddening law, wavelength 
coverage and filter setup used in the fitting. This is a 
crucial question in galaxy formation and evolution as most 
conclusions that are derived from data are fully based 
on the galaxy properties that are obt ained through this 
method (see e.g. the recent review bv IShaplevI I2OIII ). As 
test particles we use mock galaxies whose properties are 
known and well defined. We consider both star-forming and 
passive galaxies and a large redshift range, from 0.5 to 3. 
Mock star-forming galaxies are taken from a semi-analytical 
galaxy formation model (GallCS) which uses as input the 
same stellar population model (by Maraston 2005) used 
for the fitting templates, but we also experiment with 
other stellar population models for both the fitting and the 
semi-analytic galaxies (results for this are fully discussed in 
the Appendix). Mock passive galaxies are simulated using 
solar metallicity simple stellar populations. We then treat 
the mocks as real observed galaxies, i.e. we obtain their 
observed-frame magnitudes, apply SED-fitting with various 
templates and compare the properties derived from the 
fitting to their true values. 

This work complements an earlier paper (Maraston et 
al. 2010) where we discuss the problem of deriving star 
formation rates and masses for real 2: ~ 2 star-forming 
galaxies right at the peak of the cosmic star formation rate. 
There we found that the adopted star formation history 
affects the results, as well as the priors that are set on 
the galaxy formation ages. This happens because the age 
parameter becomes very poorly constrained for galaxies 
with on-going star formation, due to the overshining of 
older generations by the youngest stars, even when these 
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Table 3. Scaling relations for mock star-forming galaxies in the 
form of iooM* . „ , , = ai + ao *3: where x stands for iooM* 
of the various fitting setups. The coefficients a\ and 02 represent 
the unreddened case, h\ and 62 represent the reddened case. 



Table 4. Scaling relations for passive galaxies in the form of 
loqM* -,„, , = ai + a2 * X where x stands for loqM* of the 
various fitting setups. The coefficients ai and 02 represent the 
unreddened case, fei and 62 represent the reddened case. 
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0.5159 
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0.0243 


1.0095 


0.3167 


0.9821 


1.00 


wide Chabrier 


0.0913 


1.0042 


0.1660 


0.9974 


2.00 


wide Chabrier 


0.1477 


1.0007 


0.2017 


0.9960 


3.00 


wide Chabrier 


0.1612 


1.0001 


0.1615 


1.0001 


0.50 


wide Kroupa 


-0.0188 


1.0124 


0.0795 


1.0022 


1.00 


wide Kroupa 


0.0785 


1.0035 


-0.7200 


1.0731 


2.00 


wide Kroupa 


0.1289 


0.9996 


0.1785 


0.9952 


3.00 


wide Kroupa 


0.1217 


1.0004 


0.1214 


1.0005 


0.50 


wide Zq 


0.1568 


0.9865 


-0.1059 


1.0129 


1.00 


wide Zq 


0.0227 


0.9983 


-0.0418 


1.0051 


2.00 


wide Zq 


0.0773 


0.9941 


-0.0145 


1.0022 


3.00 


wide Zq 


0.0006 


0.9999 


0.0003 


1.0000 


0.50 


only-T 


-0.0161 


1.0012 


-0.2548 


1.0261 


1.00 


only-T 


0.0442 


0.9966 


-1.0724 


1.0948 


2.00 


only-T 


0.0592 


0.9963 


0.0013 


1.0005 


3.00 


only-T 


-0.0512 


1.0060 


-0.0336 


1.0046 


0.50 


SSPs 


-0.0401 


1.0043 


0.0877 


0.9914 


1.00 


SSPs 


-0.0498 


1.0039 


0.3950 


0.9628 


2.00 


SSPs 


0.2261 
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0.0685 
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0.0007 


0.9854 


3.00 


wide BC03 
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make up just a few percent of the total stellar mass. As a 
main result of that paper we put forward a star formation 
history that allows for a very good determination of star 
formation rates and masses of star-forming galaxies. This 
is an exponentially-increasing star formation with high 
formation redshift and a star formation timescale t ~ 0.4 
Gyr. We also find that normally adopted T-models underes- 
timate the stellar mass and overestimate the star formation 
rates. In this paper we quantify these various offsets using 
simulated galaxies (Appendix [B]). 

Our results for mock star-forming galaxies can be sum- 
marised as follows: 

- The match between template and real SFH is crucial for 
a simultaneous determination of the galaxy physical pa- 
rameters. For example, at high redshift - when galaxy ages 
span a confined range - stellar masses are very reasonably 



determined - within 0.1 dex - with a variety of template 
choices. However, this comes as a compensating effect of 
other properties being underestimated/overestimated. 

- When there is a SFH mismatch, the direction is typically 
at underestimating ages and stellar masses and overestimat- 
ing reddening and SFRs, both due to overshining from the 
youngest stellar generations and the age-dust degeneracy. 
We quantify these effects as functions of adopted fitting 
setup. 

- Inverted-T models with the exact same prescriptions 
as in MIO allow for a virtually perfect mass and SFR 
determination of 2; ~ 2 — 3 galaxies. 

- When galaxies are older and have residual star formation, 
as is the case for the semi-analytic galaxies at lower 
redshifts, the larger age span and the age-dust degeneracy 
cause the mass (and the age) to be severely underestimated 
(by up to 0.6 dex) using standard templates composed of a 
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variety of star formation histories. 

- An efficient tricfc to reduce this effect is to switch off 
reddening in the fit as this helps avoiding unrealistically 
young and dusty solutions. Reddening and star formation 
rates should then be obtained through a separate fit. 

- A similar effect is obtained by constraining the minimum 
age in the fitting as already done in the literature. 

- Ages derived from the best fit are best compared to 
mass-weighted ages. They are generally underestimated. 

- Metallicity effects play a secondary role. 

- The exact type of reddening law can be correctly identified 
when dust reddening is present in the galaxies even if at 
high redshift differences in stellar mass arising from the use 
of various reddening laws are negligible. 

- Photometric uncertainties alone produce an uncertainty 
of on average 0.08 dex in stellar mass and SFR and of 0.07 
dex in age. 

- At all redshifts, a wavelength coverage from the rest-frame 
UV to the rest-frame near-lR allows to obtain the most 
robust results. 

- Uncertainties stemming from different stellar population 
models amount to ~ 0.2 dex in stellar mass (~ 0.4 dex in 
the reddened case at z=0.5) and ~ 0.3 dex in age (~ f.2 
dex in the reddened case at z=0.5). 

For mock passive galaxies our main results are: 

- Stellar masses can be recovered very well. The dispersion 
from the true value amounts on average to less than 0.05 
dex. 

- Metallicity effects become important for aged passive 
galaxies. The use of various metallicities in the fitting 
appears to be the most secure option. 

- The effect from photometric errors alone on the ages and 
stellar masses is ~ 0.2 dex. 

- The wavelength dependence of the best fit ages for old 
passive galaxies is smaller than in the case of star-forming 
galaxies, but the scatter is substantially reduced by includ- 
ing the rest-frame near-IR in the fit. 

We conclude that the stellar population parameters of star- 
forming and passive galaxies can be reasonably well deter- 
mined provided one uses the right setup and wavelength cov- 
erage. Although degeneracies between age, dust and metal- 
licity can have a significant effect, the mismatch between 
template and real SFH dominates at low redshift. Because 
of this and of the larger spread in stellar ages the stellar 
population properties of galaxies at high redshift are better 
recovered than those at low redshift. Based on our exten- 
sive study we conclude that at z = 0.5 and 1 stellar pop- 
ulation parameters are best recovered with a wide fitting 
setup, setting a minimum age of O.f Gyr in the fitting and 
excluding reddening from the fitting parameters. This solu- 
tion is acceptable, though not optimal, and future work will 
be devoted in envisaging a good analytical description for 
this type of galaxies. 

For z ^ 2 — 3 inverted-r models give the best results for 
both mass and star formation rates. 

In all cases, a wavelength coverage from UV to near-IR rest- 
frame wavelengths in the fitting ensures the best results, 
though in some cases it is possible to find alternative filter 
band setups that can give reasonable results. We quantify 
these effects which will be useful for the planning of surveys 



and observational proposals. 

Finally, due to the variety of assumptions made in the lit- 
erature regarding fitting methods, stellar population mod- 
els and adopted filters, caution is required whe n compar- 
ing SED-fitting results between different studies IjLee et al.l 
(2009)). In order to ease these comparisons we provide scal- 
ing relations for the stellar mass that allow transformation 
from one set of fitting parameters and population models to 
another. 
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Figure CI. Ages derived from SED-fitting with different stel- 
lar population synthesis models (M05 and BC03) for mock star- 
forming galaxies based on M05 photometry. A wide setup and 
wavelength coverage was used. Black refers to the unreddened 
case, red to the case including reddening. 



Figure C2. Same as Fig. ICll for the derived metallicities. Red 
histograms stand for BC03-based results, black shaded histograms 
for M05-based results. 
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APPENDIX A: OVERVIEW OVER FITTING 
SETUPS 

We summarise the fitting setups explored througliout tliis 
work in Table \K^ 



APPENDIX B: OVERVIEW OVER FITTING 
RESULTS 

We summarise median offsets and 68% confidence ranges for 
age, E(B-V), stellar mass and SFR in Tables [Bl] - |B4] for a 
selected number of fitting setups. 



APPENDIX C: FITTING BC03 TEMPLATES 
TO M05 GALAXIES 

It is interesting to see the effect of fitting galaxies adopting 
a certain stellar evolution with templates adopting another 
prescription. To this aim we fitted the semi-analytic 
star-forming galaxies whose spectra adopt the M05 stellar 
population models with templates based on the BC03 
stellar population models. Results are shown in Figs. ICll - 
IC5I Median offsets and 68% ranges are given in Tables IBll 
and lB2l Ages derived from BC03 templates are on average 
older than those from the M05-based templates (Fig. IC1|I . 
Median offsets differ by at least ~ 0.2 dex and can differ 
by up to ~ 1.2 dex in the reddened case at low redshift. 
This is clearly a TP-AGB effect for ages between ~ 0.2 




0.4 0.8 0.4 O.E 

E(B-V) M05 wide setup 

Figure C3. Same as Fig. ICll for the comparison of derived dust 
attenuation. 



and 2 Gyr (M05, M06). M05 templates contain more flux 
in the rest-frame near-IR caused by the TP-AGB. BC03 
templates have a lower contribution from this phase, hence 
an older age of these templates is required to match the 
near-IR flux. At ages ^ 2 Gyr (and metallicities ^ Zq) MQ5 
models are bluer than BC03 models because of the warmer 
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Table Al. Overview of all fits carried out with various template and filter setups for simulated galaxies at redshift 0.5, 1, 2 and 3. Fits that 
involve constraining or rebinning the age grid were only carried out for mock star-forming galaxies. Fits with inverted-r models as a function 
of wavelength dependence were only carried out for mock star-forming galaxies at z=2. 
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Red Giant Brancto (RGB) in the Frascati stellar tracks 
used in M05. The optical/near-IR ratio is most affected by 
these differences in the stellar tracks, for example the V-K 
colour is 0.08 mag bluer in the Frascati tracks (see M05 
for more details). This is compensated by younger ages 
obtained with BC03 templates. 

For the youngest ages (at low redshift) differences between 
M05 and BC03-based models stem from the different input 
stellar tracks for young populations, namely Geneva for 
M05 and Padova for BC03. Geneva tracks in particular are 
redder than Padova tracks around 10 Myr, an effect due to 
the assumed mass-loss and post-MS evolution. This implies 
a redder M05 model at very young ages which may drive 
BC03 templates to fit with older ages. 

Fig. IC2I shows that the best fit solution obtained from 
BC03-based templates is on average more metal-rich than 
that from M05-based templates. This in combination 
(and degeneracy) with age compensates the lower fuel in 
TP-AGB and red supergiant stars in the model. 
At low redshift the reddening is better determined with 
BC03-based templates which on average provide less 



^^ most likely caused by a different treatment of the mixing 
length in these tracks in comparison to the Padova tracks used in 
BC03, see M05 Fig. 9 



reddening (Fig. [Cl median offset of AE{B - V) = 0.12). 
Obviously, this is because ages are older and metallicities 
are higher. However, as the model galaxies evolve a la M05, 
this better agreement is contrived as it originates from the 
compensation of wrong determination of parameters. At 
high redshift, the reddening values from both models agree 
very well. 

Masses derived with templates based on BC03 stellar 
population models are on average higher than M05-based 
masses. Median offsets differ by ~ 0.2 to ~ 0.4 dex. The 
mass for M* > lO^'' M© at z ^ 1 is higher by up to 0.6 
dex (Fig. IC4II . This is in agreement with previous findings 
of M06 and Pozzetti et al. (2007). In the unreddened case 
this clearly points out the different TP-AGB treatment, 
particularly at high redshift where a significant amount of 
stars is found in this evolutionary phase. In the reddened 
case the age-dust degeneracy dominates. At z ^ 1 the lower 
TP-AGB in the BC03 models causes just the right amount 
of overestimation in age to correctly predict stellar masses. 
Given that the physical effect is to underestimate the mass, 
the larger masses derived with BC03 templates are closer to 
the correct mass but for the wrong reason. At 2 ^ 2 BC03 
templates overestimate the masses at the high-mass end for 
this reason. 
The SFRs derived with both models agree very well at low 



© 0000 RAS, MNRAS 000, 000-000 



Recovering galaxy properties from SED-fitting 41 



Table Bl. Median offsets and ranges including 68% of the solutions for A 
star- forming galaxies at 2 = 0.5 derived with various fitting setups. SFRs of 
often causing large negative values for A log SFR. 



log age, A E{B-V), A log M* and A log SFR of mock 
zero are set to 0.001 in order to compute the logarithm 



setup 


A log age 
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A log M* 




A log SFR 




z=0.5, no reddening 


median 


range 




median 
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median 
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wide Salpeter 
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wide Kroupa 


-0.39 
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wide Chabier 
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-2.31 to -0.11 


wide top-heavy 
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0.00 
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Z=0.004 
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UBVRH-IRAC 
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0.00 


0.00 to 0.00 
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-0.01 


-2.31 to 0.10 


UBVRIJHK 


-0.26 


-0.80 to 0.10 


0.00 


0.00 to 0.00 


-0.21 


-0.53 to 0.03 


-0.04 


-1.51 to 0.10 


ugriz 


-0.31 


-0.74 to 0.03 


0.00 


0.00 to 0.00 


-0.23 


-0.47 to -0.03 


0.02 


-0.34 to 0.15 


RIJHK-flRAC 


-0.23 


-1.00 to 0.19 


0.00 


0.00 to 0.00 


-0.20 


-0.62 to 0.07 


-0.73 


-3.29 to 0.30 


BRIK 


-0.20 


-0.87 to 0.19 


0.00 


0.00 to 0.00 


-0.19 
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-0.03 


-2.10 to 0.12 


wide, age ^0.1 Gyr 


-0.33 


-0.89 to 0.07 


0.00 


0.00 to 0.00 


-0.30 


-0.54 to -0.04 
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-2.24 to 0.10 


wide BC03 
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-0.28 to 0.04 


z=0.5, with reddening 


wide Salpeter 
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wide Kroupa 
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-2.76 to 
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wide top-heavy 
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ugriz 
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-2.76 to 
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0.06 to 0.66 


-0.51 
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-2.83 to 
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BRIK 
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-2.89 to 
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0.39 


0.07 to 0.84 


-0.62 


-1.11 to -0.37 
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-3.41 to 1.38 


wide, age ^0.1 Gyr 


-0.87 


-1.36 to 


-0.03 


0.14 


0.00 to 0.35 


-0.38 


-0.59 to -0.08 


0.12 


-2.38 to 0.71 


wide BC03 


-0.59 


-2.71 to 0.06 


0.12 


0.00 to 0.54 


-0.24 


-0.93 to 0.02 


0.00 


-2.87 to 0.45 



redshift (Fig. IC5|I . at high redshift BC03-based template 
setups give lower SFRs on average in the reddened case. 



APPENDIX D: FITTING M05 AND BC03 
TEMPLATES TO PEGASE-BASED 
SEMI-ANALYTIC GALAXIES 



Here we investigate the reverse exercise as in Sec- 
tion [21 namely we fit M05 or BC03 templates to 
semi-analytic galaxies b a sed on Pegase templates 
IjFioc fc Rocca-Volmerangd [l997|). Results are shown 
in Figs. ID1I - ID41 Median offsets and 68% ranges are listed 
in Table iDll 
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Table B2. Same as Table iBll for mock star-forming galaxies at 2 = 2.0. 



setup 


A log age 




A E(B-V) 




A log M* 




A log SFR 




z=2, no reddening 


median 


range 


median 


range 


median 


range 


median 


range 


wide Salpeter 


-0.08 


-0.72 to 0.15 


0.00 


0.00 to 0.00 


-0.10 


-0.29 to 0.00 


0.01 


-1.52 to 0.10 


only-T 


-0.06 


-0.44 to 0.15 


0.00 


0.00 to 0.00 


-0.05 


-0.43 to 0.01 


0.09 


0.00 to 0.19 


inverted-r 


-0.47 


-0.56 to -0.31 


0.00 


0.00 to 0.00 


0.02 


-0.06 to 0.09 


0.10 


-0.03 to 0.18 


wide Kroupa 


-0.19 


-0.96 to 0.06 


0.00 


0.00 to 0.00 


-0.37 


-0.63 to -0.20 


-0.20 


-3.33 to -0.08 


wide Chabrier 


-0.13 


-0.96 to 0.04 


0.00 


0.00 to 0.00 


-0.29 


-0.66 to -0.19 


-0.25 


-3.26 to -0.12 


wide top-heavy 


-0.17 


-1.14 to 0.24 


0.00 


0.00 to 0.00 


0.39 


-0.68 to 0.73 


-3.74 


-4.17 to -0.69 


Z=0.004 


-0.03 


-0.70 to 0.15 


0.00 


0.00 to 0.00 


-0.11 


-0.24 to -0.04 


-0.02 


-1.52 to 0.08 


Z=0.04 


-0.39 


-1.29 to -0.19 


0.00 


0.00 to 0.00 


-0.16 


-0.79 to -0.06 


0.04 


-3.33 to 0.15 


UBVRI-I-IRAC 


-0.14 


-0.96 to 0.15 


0.00 


0.00 to 0.00 


-0.12 


-0.62 to -0.01 


0.03 


-3.26 to 0.14 


UBVRIJHK 


-0.16 


-1.03 to 0.15 


0.00 


0.00 to 0.00 


-0.11 


-0.62 to 0.01 


0.00 


-3.02 to 0.09 


ugriz 


-0.21 


-1.53 to 0.44 


0.00 


0.00 to 0.00 


-0.18 


-1.00 to 0.43 


0.00 


-3.88 to 0.13 


RIJHK-flRAC 


-0.11 


-0.88 to 0.12 


0.00 


0.00 to 0.00 


-0.12 


-0.35 to -0.01 


-0.02 


-3.33 to 0.10 


BRIK 


-0.09 


-0.83 to 0.23 


0.00 


0.00 to 0.00 


-0.10 


-0.49 to 0.04 


0.05 


-3.02 to 0.15 


wide, age ^0.1 Gyr 


0.00 


-0.22 to 0.24 


0.00 


0.00 to 0.00 


-0.10 


-0.21 to 0.00 


0.04 


-0.05 to 0.10 


wide BC03 


0.20 


-0.19 to 0.44 


0.00 


0.00 to 0.00 


0.12 


-0.18 to 0.24 


0.01 


-0.08 to 0.08 


z=2, with reddening 


wide Salpeter 


-0.15 


-0.89 to 0.14 


0.00 


-0.01 to 0.06 


-0.12 


-0.51 to -0.03 


0.06 


-0.35 to 0.44 


only-T 


-0.07 


-0.96 to 0.14 


0.01 


-0.01 to 0.06 


-0.06 


-0.51 to 0.01 


0.12 


-0.09 to 0.61 


inverted-r 


-0.47 


-0.56 to -0.31 


0.01 


0.00 to 0.10 


0.01 


0.06 to -0.05 


0.17 


-0.04 to 0.40 


wide Kroupa 


-0.22 


-1.07 to 0.11 


0.02 


-0.02 to 0.07 


-0.38 


-0.66 to -0.21 


-0.09 


-3.49 to 0.35 


wide Chabrier 


-0.20 


-1.04 to 0.02 


0.03 


-0.02 to 0.06 


-0.33 


-0.74 to -0.22 


-0.17 


-3.33 to 0.25 


wide top-heavy 


-0.08 


-1.54 to 0.33 


0.09 


0.00 to 0.19 


0.39 


-1.05 to 0.67 


-0.73 


-4.12 to -0.08 


Z=0.004 


-0.09 


-0.82 to 0.16 


0.02 


-0.01 to 0.10 


-0.12 


-0.47 to -0.04 


0.09 


-0.24 to 0.60 


Z=0.04 


-0.39 


-1.28 to -0.18 


0.04 


-0.03 to 0.13 


-0.16 


-0.64 to -0.07 


0.17 


-0.21 to 0.83 


UBVRH-IRAC 


-0.29 


-1.22 to 0.12 


0.04 


-0.02 to 0.10 


-0.19 


-0.73 to -0.05 


0.11 


-3.26 to 0.55 


UBVRIJHK 


-0.22 


-1.55 to 0.16 


0.03 


0.00 to 0.06 


-0.15 


-0.99 to 0.00 


0.00 


-3.76 to 0.22 


ugriz 


-0.92 


-2.04 to 0.22 


0.02 


-0.03 to 0.10 


-0.61 


-1.27 to 0.23 


-0.22 


-3.98 to 0.39 


RIJHK-I-IRAC 


-0.26 


-1.07 to 0.14 


0.00 


-0.04 to 0.13 


-0.14 


-0.46 to -0.03 


-0.05 


-3.65 to 0.56 


BRIK 


-0.13 


-1.26 to 0.28 


0.00 


-0.02 to 0.06 


-0.10 


-0.72 to 0.08 


0.09 


-2.69 to 0.49 


wide, age ^0.1 Gyr 


-0.03 


-0.31 to 0.19 


0.00 


-0.01 to 0.05 


-0.11 


-0.22 to -0.03 


0.06 


-0.10 to 0.21 


wide BC03 


0.18 


-0.77 to 0.42 


0.00 


-0.03 to 0.06 


0.12 


-0.43 to 0.25 


0.03 


-0.20 to 0.40 



Overall, we find the same trends as in section 14] ages and 
stellar masses are underestimated, reddening and SFRs are 
overestimated. 

At each redshift, M05-based templates underestimate the 
age more than BC03-based templates which is evident in 
a clear offset at z=2 in Fig. IDll Median offsets between 
recovered and mass-weighted age differ by ~ 0.3 dex in 
the unreddened case and by ^ 0.5 dex in the reddened 



case. BC03 templates recover the age of the Pegase galaxies 
similarly well as M05 templates recovered the age of 
M05 galaxies in Fig. [S] For M05 galaxies BC03 templates 
overestimate the age. 

This effect is carried through to the derived stellar masses. 
While fitting with M05 templates leads to underestimated 
masses (with a clear offset), BC03 templates recover masses 
at high redshift very well (Fig. ID2[I . At low redshift masses 
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Table B3. Same as Table iBll for mock passive galaxies at z = 0.5. 



setup 


A log age 




A E(B-V) 




A log M* 




A log SFR 




z=0.5, no reddening 


median 


range 


median 


range 


median 


range 


median 


range 


wide Salpeter 


0.03 


-0.02 to 0.15 


0.00 


0.00 to 0.00 


0.01 


-0.04 to 0.05 


0.00 


0.00 to 0.00 


only-T 


0.03 


0.00 to 0.10 


0.00 


0.00 to 0.00 


0.01 


-0.04 to 0.05 


0.00 


0.00 to 0.00 


only SSPs 


0.01 


-0.03 to 0.06 


0.00 


0.00 to 0.00 


0.01 


-0.05 to 0.04 


0.00 


0.00 to 0.00 


only solar SSP 


0.01 


-0.03 to 0.03 


0.00 


0.00 to 0.00 


0.00 


-0.05 to -0.03 


0.00 


0.00 to 0.00 


wide Kroupa 


0.05 


0.00 to 0.15 


0.00 


0.00 to 0.00 


-0.10 


-0.16 to -0.06 


0.00 


0.00 to 0.00 


wide Cliabrier 


0.05 


0.00 to 0.15 


0.00 


0.00 to 0.00 


-0.12 


-0.18 to -0.06 


0.00 


0.00 to 0.00 


wide top-heavy 


0.08 


0.00 to 0.22 


0.00 


0.00 to 0.00 


1.14 


1.02 to 1.26 


0.00 


0.00 to 0.00 


Z=0.004 


0.39 


0.15 to 0.75 


0.00 


0.00 to 0.00 


0.03 


-0.32 to 0.50 


0.00 


0.00 to 0.00 


Z=0.04 


-0.30 


-0.39 to -0.24 


0.00 


0.00 to 0.00 


-0.27 


-0.44 to -0.18 


0.00 


0.00 to 0.00 


UBVRI+IRAC 


0.05 


0.00 to 0.23 


0.00 


0.00 to 0.00 


0.03 


-0.03 to 0.08 


0.00 


0.00 to 0.00 


UBVRIJHK 


0.04 


-0.02 to 0.19 


0.00 


0.00 to 0.00 


0.02 


-0.03 to 0.10 


0.00 


0.00 to 0.00 


ugriz 


0.02 


-0.18 to 0.31 


0.00 


0.00 to 0.00 


-0.01 


-0.12 to 0.23 


0.00 


0.00 to 0.00 


RIJHK+IRAC 


0.00 


-0.18 to 0.26 


0.00 


0.00 to 0.00 


-0.04 


-0.24 to 0.09 


0.00 


0.00 to 3.00 


BRIK 


0.06 


-0.12 to 0.28 


0.00 


0.00 to 0.00 


0.02 


-0.18 to 0.13 


0.00 


0.00 to 2.30 


wide BC03 


0.05 


-0.03 to 0.44 


0.00 


0.00 to 0.00 


0.07 


0.00 to 0.32 


0.00 


0.00 to 2.30 


z=0.5, with reddening 


wide Salpeter 


0.02 


-0.09 to 0.12 


0.00 


0.00 to 0.07 


0.01 


-0.07 to 0.08 


0.00 


0.00 to 0.00 


only-r 


0.00 


-0.09 to 0.07 


0.00 


0.00 to 0.06 


0.00 


-0.08 to 0.05 


0.00 


0.00 to 0.00 


only SSPs 


0.01 


-0.09 to 0.10 


0.00 


0.00 to 0.07 


0.02 


-0.05 to 0.09 


0.00 


0.00 to 0.00 


only solar SSP 


0.00 


-0.09 to 0.03 


0.00 


0.00 to 0.06 


0.00 


-0.07 to 0.03 


0.00 


0.00 to 0.00 


wide Kroupa 


0.04 


-0.08 to 0.20 


0.00 


0.00 to 0.07 


-0.11 


-0.18 to 0.00 


0.00 


0.00 to 0.00 


wide Chabrier 


0.04 


-0.15 to 0.18 


0.00 


0.00 to 0.07 


-0.12 


-0.22 to -0.03 


0.00 


0.00 to 0.00 


wide top-heavy 


0.08 


-0.12 to 0.30 


0.00 


0.00 to 0.13 


1.20 


1.02 to 1.46 


0.00 


0.00 to 0.00 


Z=0.004 


0.31 


0.08 to 0.56 


0.19 


0.13 to 0.26 


0.31 


0.17 to 0.47 


0.00 


0.00 to 0.00 


Z=0.04 


-0.34 


-0.55 to -0.26 


0.00 


0.00 to 0.00 


-0.27 


-0.44 to -0.18 


0.00 


0.00 to 0.00 


UBVRI-I-IRAC 


0.04 


-0.12 to 0.20 


0.00 


0.00 to 0.07 


0.03 


-0.06 to 0.17 


0.00 


0.00 to 0.00 


UBVRIJHK 


0.03 


-0.07 to 0.23 


0.00 


0.00 to 0.07 


0.02 


-0.06 to 0.15 


0.00 


0.00 to 0.00 


ugriz 


-0.04 


-1.50 to 0.10 


0.00 


0.00 to 0.84 


-0.01 


-0.14 to 0.14 


0.00 


0.00 to 0.00 


RIJHK-I-IRAC 


-0.13 


-2.36 to 0.11 


0.07 


0.00 to 0.71 


-0.14 


-1.16 to 0.07 


0.00 


0.00 to 2.60 


BRIK 


-0.07 


-1.39 to 0.18 


0.07 


0.00 to 0.59 


-0.04 


-0.46 to 0.12 


0.00 


0.00 to 0.00 


wide BC03 


0.00 


-0.08 to 0.21 


0.00 


0.00 to 0.13 


0.07 


-0.01 to 0.22 


0.00 


0.00 to 0.00 



are underestimated. The median offsets in stellar mass differ 
by -^ 0.3 dex between the two different stellar population 
models. 

Importantly here BC03 templates show the same behaviour 
as M05 templates for M05 galaxies (compare to Fig. 1111) . 
namely the mass is underestimated. This shows again 
that when fitting M05-type galaxies with BC03 templates 
the correct determination of the mass is just due to a 



good concatenation of compensating effects, namely the 

determination of an older age due to TP-AGB effects, which 

is by good chance similar to the age of the underlying older 

population. 

Reddening and SFRs are comparable for M05 and BC03 

templates, independently of the flavour of the input galax- 
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Table B4. Same as Table |B3| for mock passive galaxies at 2 = 2.0. 



setup 


A log age 




A E{B-V) 




A log M* 




A log SFR 




z=2, no reddening 


median 


range 


median 


range 


median 


range 


median 


range 


wide Salpeter 


0.01 


0.00 to 0.19 


0.00 


0.00 to 0.00 


-0.01 


-0.02 to 0.00 


0.00 


0.00 to 0.00 


only-T 


0.01 


0.00 to 0.03 


0.00 


0.00 to 0.00 


-0.02 


-0.05 to 0.00 


0.00 


0.00 to 0.00 


only SSPs 


0.01 


0.00 to 0.17 


0.00 


0.00 to 0.00 


0.00 


-0.03 to 0.03 


0.00 


0.00 to 0.00 


only solar SSP 


0.00 


-0.02 to 0.01 


0.00 


0.00 to 0.00 


-0.02 


-0.07 to 0.00 


0.00 


0.00 to 0.00 


wide Kroupa 


0.01 


0.00 to 0.17 


0.00 


0.00 to 0.00 


-0.14 


-0.15 to -0.12 


0.00 


0.00 to 0.00 


wide Cliabrier 


0.01 


0.00 to 0.17 


0.00 


0.00 to 0.00 


-0.17 


-0.18 to -0.14 


0.00 


0.00 to 0.00 


wide top-heavy 


0.03 


0.01 to 0.17 


0.00 


0.00 to 0.00 


1.00 


0.87 to 1.07 


0.00 


0.00 to 0.00 


Z=0.004 


0.30 


0.18 to 0.48 


0.00 


0.00 to 0.00 


-1.51 


-2.19 to -0.48 


0.00 


0.00 to 0.00 


Z=0.04 


-0.27 


-0.29 to -0.24 


0.00 


0.00 to 0.00 


-0.25 


-0.31 to -0.19 


0.00 


0.00 to 0.00 


UBVRI+IRAC 


0.01 


0.00 to 0.15 


0.00 


0.00 to 0.00 


-0.01 


-0.05 to 0.04 


0.00 


0.00 to 0.00 


UBVRIJHK 


0.01 


0.00 to 0.17 


0.00 


0.00 to 0.00 


0.00 


-0.04 to 0.03 


0.00 


0.00 to 0.00 


ugriz 


0.01 


-0.09 to 0.17 


0.00 


0.00 to 0.00 


0.01 


-0.06 to 0.07 


0.00 


0.00 to 0.00 


RIJHK+IRAC 


0.14 


0.00 to 0.30 


0.00 


0.00 to 0.00 


0.05 


-0.07 to 0.16 


0.00 


0.00 to 2.60 


BRIK 


0.00 


-0.27 to 0.01 


0.00 


0.00 to 0.00 


-0.04 


-0.15 to 0.01 


0.00 


0.00 to 0.00 


wide BC03 


-0.42 


-0.54 to -0.24 


0.00 


0.00 to 0.00 


-3.83 


-4.49 to -2.89 


0.00 


0.00 to 0.00 


z=2, with reddening 


wide Salpeter 


0.01 


0.00 to 0.19 


0.00 


0.00 to 0.00 


-0.01 


-0.02 to 0.02 


0.00 


0.00 to 0.00 


only-T 


0.01 


-0.02 to 0.03 


0.00 


0.00 to 0.05 


-0.01 


-0.02 to 0.01 


0.00 


0.00 to 0.00 


only SSPs 


0.01 


0.00 to 0.17 


0.00 


0.00 to 0.00 


0.00 


-0.02 to 0.03 


0.00 


0.00 to 0.00 


only solar SSP 


0.00 


-0.02 to 0.01 


0.00 


0.00 to 0.00 


-0.02 


-0.07 to 0.00 


0.00 


0.00 to 0.00 


wide Kroupa 


0.01 


0.00 to 0.05 


0.00 


0.00 to 0.00 


-0.14 


-0.15 to -0.11 


0.00 


0.00 to 0.00 


wide Chabrier 


0.01 


0.00 to 0.17 


0.00 


0.00 to 0.00 


-0.17 


-0.18 to -0.14 


0.00 


0.00 to 0.00 


wide top-heavy 


0.01 


0.00 to 0.17 


0.00 


0.00 to 0.00 


1.00 


0.87 to 1.06 


0.00 


0.00 to 0.00 


Z=0.004 


0.30 


0.18 to 0.48 


0.37 


0.22 to 0.44 


0.42 


0.32 to 0.53 


0.00 


0.00 to 0.00 


Z=0.04 


-0.27 


-0.29 to -0.24 


0.00 


0.00 to 0.00 


-0.25 


-0.30 to -0.19 


0.00 


0.00 to 0.00 


UBVRI-I-IRAC 


0.01 


0.00 to 0.15 


0.00 


0.00 to 0.00 


-0.01 


-0.05 to 0.04 


0.00 


0.00 to 0.00 


UBVRIJHK 


0.01 


-0.02 to 0.15 


0.00 


0.00 to 0.05 


0.02 


-0.01 to 0.07 


0.00 


0.00 to 0.00 


ugriz 


0.00 


-0.24 to 0.12 


0.00 


0.00 to 0.06 


0.02 


-0.06 to 0.12 


0.00 


0.00 to 0.00 


RIJHK-I-IRAC 


0.18 


-0.04 to 0.30 


0.00 


0.00 to 0.26 


0.07 


-0.07 to 0.21 


0.00 


0.00 to 2.30 


BRIK 


0.00 


-0.27 to 0.11 


0.00 


0.00 to 0.05 


-0.03 


-0.14 to 0.02 


0.00 


0.00 to 0.00 


wide BC03 


-0.42 


-0.49 to -0.19 


0.59 


0.44 to 0.66 


0.13 


0.09 to 0.29 


0.00 


0.00 to 0.00 



APPENDIX E: COMPARISON WITH WUYTS 
ET AL. (2009) 

For a direct comparison with Wuyts et al. (2009) we show 
here our version of their Fig. 8 in which we swapped the 
Av row for a row showing the metallicity (Fig. lEl |). We also 
list median values with 68% confidence levels of the differ- 
ence between derived and true age, mass, SFR and E(B-V), 
together with those of Wuyts et al. (2009) in Table [El] 
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Table Dl. Same as Table [BT] for mock star-forming galaxies at z = 0.5 and z = 2.0 based on Pegase photometry. 



setup 


A log age 




A E(B-V) 




A log M* 






A log SFR 




z=0.5, no reddening 


median 


range 


median 


range 


median 


range 




median 


range 


wide M05 


-0.73 


-0.97 to -0.11 


0.00 


0.00 to 0.00 


-0.53 


-0.68 to 


-0.21 


-0.16 


-2.76 to 0.13 


wide BC03 


-0.30 


-0.54 to 0.16 


0.00 


0.00 to 0.00 


-0.28 


-0.45 to 


-0.06 


-0.02 


-2.00 to 0.08 


z=0.5, with reddening 


wide M05 


-2.49 


-2.91 to -0.88 


0.29 


0.05 to 0.64 


-0.85 


-1.51 to 


-0.49 


-2.88 


-3.41 to 0.39 


wide BC03 


-1.00 


-2.96 to -0.20 


0.15 


0.00 to 0.64 


-0.49 


-1.13 to 


-0.20 


-0.70 


-3.26 to 0.30 


z=2, no reddening 


wide M05 


-0.52 


-1.05 to -0.33 


0.00 


0.00 to 0.00 


-0.35 


-0.63 to 


-0.24 


-0.01 


-3.45 to 0.14 


wide BC03 


-0.22 


-1.21 to 0.13 


0.00 


0.00 to 0.00 


-0.20 


-0.78 to 


-0.08 


-0.03 


-3.26 to 0.08 


z=2, with reddening 


wide M05 


-0.89 


-1.77 to -0.41 


0.06 


0.00 to 0.13 


-0.52 


-1.02 to 


-0.28 


0.09 


-3.89 to 0.72 


wide BC03 


-0.36 


-1.46 to 0.03 


0.05 


-0.02 to 0.13 


-0.25 


-0.81 to 


-0.09 


0.01 


-3.64 to 0.64 
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Figure C4. Same as Fig. ICll for the comparison of derived stel- 
lar masses. Black refers to the unreddened case, red to the case 
including reddening. 



Figure C5. Same as Fig. ICll for the comparison of derived star 
formation rates. Black refers to the unreddened case, red to the 
case including reddening. 
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Figure Dl. Ages derived from SED-fitting with different stellar population synthesis models (M05 and BC03) for mock star-forming 
galaxies based on Pegase stellar evolution as a function of the age of oldest SSP in the galaxy (left panels) and mass- weighted ages (right 
panels). Shown are the cases for z=0.5 (upper part) and z=2 (lower part). A wide setup and wavelength coverage was used for both, 
M05 and BC03. Black refers to the unreddened case, red to the case including reddening. 
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Figure D2. Same as in the previous figure for stellar masses. Quality factors are given for the entire mass range. 
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Figure D4. Same as Fie:. ID2l for the comparison of derived star formation rates. 
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Figure El. Basic correlations between age, dust reddening, stellar mass and SFR derived from SED-fitting using a wide setup and a 
wavelength coverage of UBVRIJHK+IRAC bands in the reddened case. A is the difference between estimated and true quantity. For 
stellar mass, age and SFR we used the logarithmic values. The true age for the mock galaxies is represented by the mass-weighted age. 
Negative values of A mean underestimation, positive values stand for overestimation. We show objects at z = 0.5 in black, at z = 1 in 
magenta, 2 = 2 in green and at z = 3 in yellow. - Reproduced from lWuvts et al.l 1 120091 ) with added metallicity row and omitted Ay row. 
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Table El. Comparison of our SED-fitting performance for age, reddening, stellar mass and SFR with lWuvts et alj ||2009| V We list 
the median (Am) values of A with their 68% intervals, respectively, for mock star-forming galaxies for 4 setups: 1) a wide setup with 
UBVRIJHK+IRAC filter bands in the reddening case, 2) same as 1) with a minimum age of 0.1 Gyr, 3) inverted-r at z=2 (age fixed 
to 2 Gyr) and z=3 (age fixed to 1.1 Gyr). For mock passive galaxies we list only the cases for the wide setup and no reddening in 
the fitting. Values of Wuyts et al. are medians with 68% intervals. A is defined as the difference between recovered and true value. 
For stellar mass, age and SFR the logarithmic values are used. The true age for the mock star-forming galaxies is represented by the 
mass-weighted age. For inverted-r models we compare to the oldest age. Ages from Wuyts et. al. refer to mass-weighted ages. Negative 
values of A mean underestimation of the according property, positive values stand for overestimation. In cases in which cither true or 
best-fit SFR equal zero, we adopt a dummy value of 0.001 Mq /yr to be able to express them in logarithm. 
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Figure D3. Same as Fig. IC3l for the comparison of derived dust 
attenuation when mock star-forming galaxies are based on Pegase 
models. Red dots show results derived with BC03-based templates 
in the fitting, black those with M05-based templates. 
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